<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xml:lang="EN" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">Front. Anim. Sci.</journal-id>
<journal-title>Frontiers in Animal Science</journal-title>
<abbrev-journal-title abbrev-type="pubmed">Front. Anim. Sci.</abbrev-journal-title>
<issn pub-type="epub">2673-6225</issn>
<publisher>
<publisher-name>Frontiers Media S.A.</publisher-name>
</publisher>
</journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3389/fanim.2021.703380</article-id>
<article-categories>
<subj-group subj-group-type="heading">
<subject>Animal Science</subject>
<subj-group>
<subject>Original Research</subject>
</subj-group>
</subj-group>
</article-categories>
<title-group>
<article-title>Time-Consuming, but Necessary: A Wide Range of Measures Should Be Included in Welfare Assessments for Dairy Herds</article-title>
</title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Collins</surname> <given-names>Sophie</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/1377446/overview"/>
</contrib>
<contrib contrib-type="author" corresp="yes">
<name><surname>Burn</surname> <given-names>Charlotte C.</given-names></name>
<xref ref-type="corresp" rid="c001"><sup>&#x0002A;</sup></xref>
<uri xlink:href="http://loop.frontiersin.org/people/204523/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Wathes</surname> <given-names>Christopher M.</given-names></name>
</contrib>
<contrib contrib-type="author">
<name><surname>Cardwell</surname> <given-names>Jacqueline M.</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/578465/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Chang</surname> <given-names>Yu-Mei</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/817595/overview"/>
</contrib>
<contrib contrib-type="author">
<name><surname>Bell</surname> <given-names>Nicholas J.</given-names></name>
<uri xlink:href="http://loop.frontiersin.org/people/1149158/overview"/>
</contrib>
</contrib-group>
<aff><institution>Production and Population Health, Royal Veterinary College</institution>, <addr-line>Hertfordshire</addr-line>, <country>United Kingdom</country></aff>
<author-notes>
<fn fn-type="edited-by"><p>Edited by: Harry J. Blokhuis, Swedish University of Agricultural Sciences, Sweden</p></fn>
<fn fn-type="edited-by"><p>Reviewed by: Sandra Edwards, Newcastle University, United Kingdom; Christoph Winckler, University of Natural Resources and Life Sciences Vienna, Austria</p></fn>
<corresp id="c001">&#x0002A;Correspondence: Charlotte C. Burn <email>cburn&#x00040;rvc.ac.uk</email></corresp>
<fn fn-type="other" id="fn001"><p>This article was submitted to Animal Welfare and Policy, a section of the journal Frontiers in Animal Science</p></fn></author-notes>
<pub-date pub-type="epub">
<day>17</day>
<month>11</month>
<year>2021</year>
</pub-date>
<pub-date pub-type="collection">
<year>2021</year>
</pub-date>
<volume>2</volume>
<elocation-id>703380</elocation-id>
<history>
<date date-type="received">
<day>30</day>
<month>04</month>
<year>2021</year>
</date>
<date date-type="accepted">
<day>11</day>
<month>10</month>
<year>2021</year>
</date>
</history>
<permissions>
<copyright-statement>Copyright &#x000A9; 2021 Collins, Burn, Wathes, Cardwell, Chang and Bell.</copyright-statement>
<copyright-year>2021</copyright-year>
<copyright-holder>Collins, Burn, Wathes, Cardwell, Chang and Bell</copyright-holder>
<license xlink:href="http://creativecommons.org/licenses/by/4.0/"><p>This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.</p></license>
</permissions>
<abstract><p>Animal welfare assessments that measure welfare outcomes, including behavior and health, can be highly valid. However, the time and skill required are major barriers to their use. We explored whether feasibility of welfare outcome assessment for dairy herds may be improved by rationalizing the number of measures included. We compared two approaches: analyzing whether strong pairwise associations between measures existed, enabling the subsequent exclusion of associated measures; and identifying possible summary measures&#x02014;&#x0201C;iceberg indicators&#x0201D;&#x02014;of dairy herd welfare that could predict herd welfare status. A cross-sectional study of dairy herd welfare was undertaken by a single assessor on 51 English farms, in which 96 welfare outcome measures were assessed. All measures showed at least one pairwise association; percentage of lame cows showed the most (33 correlations). However, most correlations were weak&#x02013;moderate, suggesting limited scope for excluding measures from protocols based on pairwise relationships. A composite measure of the largest portion of herd welfare status was then identified <italic>via</italic> Principal Component Analysis (Principal Component 1, accounting for 16.9% of variance), and linear regression revealed that 22 measures correlated with this. Of these 22, agreement statistics indicated that percentage of lame cows and qualitative descriptors of &#x0201C;calmness&#x0201D; and &#x0201C;happiness&#x0201D; best predicted Principal Component 1. However, even these correctly classified only &#x0007E;50% of farms according to which quartile of the Principal Component 1 they occupied. Further research is recommended, but results suggest that welfare assessments incorporating many diverse measures remain necessary to provide sufficient detail about dairy herd welfare.</p></abstract>
<kwd-group>
<kwd>animal welfare</kwd>
<kwd>farm animals</kwd>
<kwd>on-farm welfare assessment</kwd>
<kwd>dairy cattle</kwd>
<kwd>lameness</kwd>
<kwd>qualitative behaviour assessment</kwd>
<kwd>iceberg indicators</kwd>
<kwd>classification methods</kwd>
</kwd-group>
<counts>
<fig-count count="0"/>
<table-count count="5"/>
<equation-count count="0"/>
<ref-count count="93"/>
<page-count count="15"/>
<word-count count="13856"/>
</counts>
</article-meta>
</front>
<body>
<sec sec-type="intro" id="s1">
<title>Introduction</title>
<p>Welfare outcome (animal-based) measures are arguably the most valid indicators of animal welfare (Rushen and Passill&#x000E9;, <xref ref-type="bibr" rid="B66">1992</xref>; Knierim and Winckler, <xref ref-type="bibr" rid="B48">2009</xref>), so their inclusion in welfare assessment protocols is widely recommended (Capdeville and Veissier, <xref ref-type="bibr" rid="B18">2001</xref>; Waiblinger et al., <xref ref-type="bibr" rid="B80">2001</xref>; Webster et al., <xref ref-type="bibr" rid="B83">2004</xref>; FAWC, <xref ref-type="bibr" rid="B35">2005</xref>). However, welfare outcome assessment is often extremely time-consuming (Rushen and Passill&#x000E9;, <xref ref-type="bibr" rid="B66">1992</xref>). This is partly because it generally needs to be undertaken across multiple animals (Ito et al., <xref ref-type="bibr" rid="B47">2009</xref>; Mullan et al., <xref ref-type="bibr" rid="B55">2009a</xref>; Endres et al., <xref ref-type="bibr" rid="B33">2014</xref>) and/or multiple time points (Ito et al., <xref ref-type="bibr" rid="B47">2009</xref>; Vasseur et al., <xref ref-type="bibr" rid="B78">2012</xref>) to ensure sufficient reliability. For example, although it only takes &#x0007E;1 min to assess avoidance distance in an individual dairy cow using the Welfare Quality<sup>&#x000AE;</sup> protocol, it can take up to 70 min (depending on herd size) to assess this at the herd level (Welfare Quality, <xref ref-type="bibr" rid="B84">2009</xref>). Also, as animal welfare is multi-dimensional (Fraser et al., <xref ref-type="bibr" rid="B39">1997</xref>; Botreau et al., <xref ref-type="bibr" rid="B10">2007a</xref>) and there is no perfect welfare indicator (Mason and Mendl, <xref ref-type="bibr" rid="B53">1993</xref>), welfare assessments need to be based on multiple measures (Blokhuis et al., <xref ref-type="bibr" rid="B8">2010</xref>; Nicol et al., <xref ref-type="bibr" rid="B58">2011</xref>). Although it is sometimes possible to assess multiple welfare outcome measures simultaneously [e.g., aspects of lying behavior, social behavior, and coughing can all be recorded within a single observation period (Welfare Quality, <xref ref-type="bibr" rid="B84">2009</xref>)], the inclusion of multiple measures generally greatly increases overall assessment time. Full welfare outcome assessments can therefore take many hours to complete for a single herd (Welfare Quality, <xref ref-type="bibr" rid="B84">2009</xref>).</p>
<p>The substantial implementation time of welfare outcome assessments is a major barrier to their widespread use on-farm (Knierim and Winckler, <xref ref-type="bibr" rid="B48">2009</xref>; Sandgren et al., <xref ref-type="bibr" rid="B68">2009</xref>; Blokhuis et al., <xref ref-type="bibr" rid="B8">2010</xref>; Nyman et al., <xref ref-type="bibr" rid="B59">2011</xref>; de Vries et al., <xref ref-type="bibr" rid="B30">2013b</xref>). However, it is important they are implemented because of their enhanced validity compared with quicker, resource-based, assessments. Therefore, it is vital we work to develop solutions to improve their feasibility for legal inspections, welfare assurance schemes, and other assessments by farmers and veterinarians.</p>
<p>There are three main routes by which the feasibility of welfare outcome assessments could be improved, without compromising the overall reliability or validity of the assessments:</p>
<list list-type="simple">
<list-item><p>1) automating assessment activities (e.g., Rushen et al., <xref ref-type="bibr" rid="B65">2012</xref>; Berckmans, <xref ref-type="bibr" rid="B7">2014</xref>);</p></list-item>
<list-item><p>2) optimizing sampling strategies within farms, such as focal animal sample sizes and/or the length/frequency of observation periods to achieve an optimal balance between assessment feasibility and reliability (e.g., Ito et al., <xref ref-type="bibr" rid="B47">2009</xref>; Main et al., <xref ref-type="bibr" rid="B51">2010</xref>; Heath et al., <xref ref-type="bibr" rid="B45">2015</xref>; Van Os et al., <xref ref-type="bibr" rid="B75">2018</xref>);</p></list-item>
<list-item><p>3) optimizing the number of measures included in assessment protocols through the use of &#x0201C;summary measures&#x0201D; that can predict other or wider aspects of welfare, to achieve an optimal balance between assessment feasibility and validity (e.g., M&#x000FC;lleder et al., <xref ref-type="bibr" rid="B57">2007</xref>; Nicol et al., <xref ref-type="bibr" rid="B58">2011</xref>; Nyman et al., <xref ref-type="bibr" rid="B59">2011</xref>).</p></list-item>
</list>
<p>In this paper, we focus on the latter route.</p>
<p>There is a small but growing body of research into the optimisation of the number of measures included in assessment protocols. Some studies have investigated relationships between individual welfare outcome measures to highlight areas of potential overlap within assessment protocols, identifying apparently redundant welfare outcome measures for exclusion (e.g., M&#x000FC;lleder et al., <xref ref-type="bibr" rid="B57">2007</xref>; Nicol et al., <xref ref-type="bibr" rid="B58">2011</xref>). Other studies have investigated the existence of putative iceberg indicators which can be used, on their own, to describe the wider welfare status of farms (e.g., Sandgren et al., <xref ref-type="bibr" rid="B68">2009</xref>; Nyman et al., <xref ref-type="bibr" rid="B59">2011</xref>). Iceberg indicators are termed as such because they &#x0201C;provide an overall assessment of welfare, just as the tip of an iceberg signals its submerged bulk beneath the water&#x00027;s surface&#x0201D; (FAWC, <xref ref-type="bibr" rid="B36">2009</xref>).</p>
<p>Associations between different welfare outcomes should exist to some extent, because of causal relationships between them, such as an injury (one measure) causing cows to become lame (a second measure); because of shared underlying risk factors, such as poor housing resulting in dirtiness, injuries and lameness; or because different measures are supposed to be measuring the same thing (animal welfare). Indeed, associations between welfare outcome measures are commonly noted (e.g., Roche et al., <xref ref-type="bibr" rid="B64">2009</xref>; Weary et al., <xref ref-type="bibr" rid="B82">2009</xref>; de Vries et al., <xref ref-type="bibr" rid="B26">2011</xref>). For example, reduced time lying down (Chapinal et al., <xref ref-type="bibr" rid="B19">2009</xref>; Proudfoot et al., <xref ref-type="bibr" rid="B61">2010</xref>) and poor body condition (Green et al., <xref ref-type="bibr" rid="B40">2014</xref>; Randall et al., <xref ref-type="bibr" rid="B62">2015</xref>) are risk factors for lameness across individual cows.</p>
<p>The potential of this approach for improving welfare outcome assessment feasibility has been considered in several species including cattle (M&#x000FC;lleder et al., <xref ref-type="bibr" rid="B57">2007</xref>; de Vries et al., <xref ref-type="bibr" rid="B27">2013a</xref>,<xref ref-type="bibr" rid="B30">b</xref>, <xref ref-type="bibr" rid="B28">2014</xref>), pigs (Mullan et al., <xref ref-type="bibr" rid="B56">2009b</xref>), and chickens (Nicol et al., <xref ref-type="bibr" rid="B58">2011</xref>). These studies often did find significant pairwise associations between the measures investigated, but associations were generally weak (i.e., correlation coefficients of &#x0003C;0.4), leading authors to conclude there was little scope for using one measure to substitute for another (M&#x000FC;lleder et al., <xref ref-type="bibr" rid="B57">2007</xref>; Mullan et al., <xref ref-type="bibr" rid="B56">2009b</xref>; Nicol et al., <xref ref-type="bibr" rid="B58">2011</xref>; de Vries et al., <xref ref-type="bibr" rid="B27">2013a</xref>). However, a more encouraging result was reported by de Vries et al. (<xref ref-type="bibr" rid="B28">2014</xref>), who investigated the extent to which records-based welfare measures could predict directly observed welfare outcome measures; although the predictive ability of individual records-based measures was again poor, predictive performance substantially increased when these measures were combined into small subsets. Also, in broiler chickens, one study found considerable scope for using individual slaughter plant assessments of hockburn and footpad dermatitis to replace certain on-farm measures, which they strongly predicted, potentially reducing welfare assessment time by up to 3 h (de Jong et al., <xref ref-type="bibr" rid="B25">2015</xref>).</p>
<p>The possible existence of iceberg indicators of welfare has received relatively little theoretical consideration to date. FAWC (<xref ref-type="bibr" rid="B36">2009</xref>) postulated that some welfare outcomes may be particularly effective at summarizing overall husbandry quality and animal welfare. The idea that single measures can provide a broad assessment of animal welfare is debated, however, because of the supposed multi-dimensional nature of animal welfare. For example, an effective iceberg indicator might need to capture the extent of pain, fear, hunger, disease, contentment, and more (Dawkins, <xref ref-type="bibr" rid="B24">2006</xref>; Botreau et al., <xref ref-type="bibr" rid="B10">2007a</xref>).</p>
<p>A few studies have attempted to improve welfare assessment feasibility by identifying potential iceberg indicators of welfare, with mixed success. Some of these studies investigated whether any easily available welfare input (de Vries et al., <xref ref-type="bibr" rid="B29">2016</xref>) or welfare outcome (Sandgren et al., <xref ref-type="bibr" rid="B68">2009</xref>; Nyman et al., <xref ref-type="bibr" rid="B59">2011</xref>; Brouwer et al., <xref ref-type="bibr" rid="B15">2015</xref>; Krug et al., <xref ref-type="bibr" rid="B49">2015</xref>) measures held within national dairy databases could be used as screening tools to predict the welfare status of herds, as determined by on-farm welfare assessment. Most found that small subsets of records-based welfare outcome measures&#x02014;related primarily to herd mortality, fertility and somatic cell counts&#x02014;predicted herd welfare status with a moderate-high degree of accuracy, sensitivity and specificity. Thus, the authors concluded that there appeared to be scope for using records-based welfare outcome measures as a highly feasible means of estimating dairy herd welfare. Similarly, a study in pigs concluded there was good potential for using abattoir data as a feasible means of indicating wider welfare on pig farms (van Staaveren et al., <xref ref-type="bibr" rid="B77">2017</xref>).</p>
<p>Other studies investigated whether particular aspects of the Welfare Quality<sup>&#x000AE;</sup> assessment protocol for dairy cows could predict the assessment&#x00027;s overall classification result, i.e., whether a farm was categorized as, for example, &#x0201C;acceptable&#x0201D; or &#x0201C;enhanced&#x0201D; (Andreasen et al., <xref ref-type="bibr" rid="B3">2013</xref>; Heath et al., <xref ref-type="bibr" rid="B44">2014b</xref>). Specifically, Andreasen et al. (<xref ref-type="bibr" rid="B3">2013</xref>) investigated whether the Qualitative Behavior Assessment (QBA) component of the Welfare Quality<sup>&#x000AE;</sup> protocol could predict the assessment&#x00027;s overall classification result, because it is intended to capture the animals&#x00027; expressions of their own subjective experiences (e.g., Wemelsfelder et al., <xref ref-type="bibr" rid="B87">2001</xref>; Wemelsfelder, <xref ref-type="bibr" rid="B85">2007</xref>). However, they found no significant correlation between QBA and the overall classification result. In contrast, Heath et al. (<xref ref-type="bibr" rid="B43">2014a</xref>) found that&#x02014;when analyzed within a diagnostic agreement framework&#x02014;the QBA component of the Welfare Quality<sup>&#x000AE;</sup> protocol was reasonably good at predicting the overall classification result (67% predictive accuracy). That study investigated the extent to which many different components of the Welfare Quality<sup>&#x000AE;</sup> protocol, including welfare inputs, could predict the overall classification result. Unexpectedly, the best performing component was the &#x0201C;absence of thirst&#x0201D; criterion, which comprises several welfare input measures related to water provision. The authors argue that, rather than water provision being highly informative/integrative <italic>per se</italic>, this result likely reflects previously identified problems with the Welfare Quality<sup>&#x000AE;</sup> multi-criteria aggregation method used to generate the overall classification result (Heath et al., <xref ref-type="bibr" rid="B43">2014a</xref>). This is because the &#x0201C;absence of thirst&#x0201D; criterion was unintentionally weighted more highly within the overall aggregation process compared with many of the other criteria within the assessment protocol (de Vries et al., <xref ref-type="bibr" rid="B27">2013a</xref>), creating an especially strong relationship between this criterion and the overall classification (Heath et al., <xref ref-type="bibr" rid="B43">2014a</xref>). A similar issue was described when using the Welfare Quality<sup>&#x000AE;</sup> protocol for broiler chickens, where the final classification was heavily influenced only by &#x0201C;drinker space&#x0201D; and &#x0201C;stocking density,&#x0201D; and the classification was extremely insensitive to changes in other constituent welfare outcome measures (Buijs et al., <xref ref-type="bibr" rid="B16">2016</xref>). It is thus difficult to draw firm conclusions about the possible existence of iceberg indicators of welfare based on these latter studies, but clearly it will be important to develop a welfare classification system that does not give undue weight to constituent measures that are unlikely to be key determinants or signals of overall animal welfare.</p>
<p>In this study we aimed to evaluate whether the feasibility of welfare outcome assessment for dairy herds could be improved by rationalizing the number of measures used. The objectives were to identify pairwise correlations between measures of welfare and to identify putative iceberg indicators of welfare, <italic>via</italic> a cross-sectional study incorporating a comprehensive welfare outcome assessment.</p>
<p>Our hypotheses were that, if some welfare indicators are highly predictive of overall welfare, then firstly pairwise associations will exist between different individual welfare outcome measures for UK dairy herds; and secondly measures of the overall welfare statuses of UK dairy herds can be predicted by a few specific individual welfare outcome measures (iceberg indicators).</p>
<p>The two hypotheses are complementary, but it was necessary to test both because, whilst use of iceberg indicators would be the more efficient approach, it was uncertain that iceberg indicators would even exist. If none existed, then pairwise analyses could at least indicate if any individual measures were redundant (and thus could be excluded) because of a very strong correlation with another measure.</p>
</sec>
<sec sec-type="materials and methods" id="s2">
<title>Materials and Methods</title>
<sec>
<title>Selection and Recruitment of Farms</title>
<p>Farms were recruited from a database of 468 British dairy farms that had participated in the AHDB Dairy (formally DairyCo) Milkbench&#x0002B; Profitability Benchmarking Scheme in 2012 [described in DairyCo (<xref ref-type="bibr" rid="B23">2014</xref>)]. We used a purposive stratified sampling approach to select farms of differing system types and potential welfare statuses into the study. To do this, relevant Milkbench&#x0002B; variables (e.g., &#x0201C;amount of non-forage feed fed/cow/year&#x0201D; and &#x0201C;% cows culled&#x0201D;) were submitted to an exploratory principal component analysis (PCA), to help identify composite measures to give an approximation relevant to system type and welfare status. This revealed two distinct principal components, which did not map entirely onto system type or welfare status, but that appeared to describe the overall &#x0201C;production intensity&#x0201D; (e.g., average milk yield/cow/year, amount of non-forage feed fed/cow/year) and general &#x0201C;mortality/morbidity status&#x0201D; (e.g., % cows culled, average milk SCC/year) of farms, respectively. We used the &#x0201C;production intensity&#x0201D; and &#x0201C;mortality/morbidity&#x0201D; principal component scale quartile values to stratify the 468 Milkbench&#x0002B; farms into a total of 16 farm system type/herd welfare status categories. The stratification process resulted in categories spanning (a) lower input/output farms with poorer welfare (higher mortality/morbidity), (b) lower input/output farms with better welfare through to (c) higher input/output farms with poorer welfare and (d) higher input/output farms with better welfare. A similar sampling approach has been used in a number of previous studies to actively recruit farms of a range of different system types (Haskell et al., <xref ref-type="bibr" rid="B42">2006</xref>), herd sizes (Nyman et al., <xref ref-type="bibr" rid="B59">2011</xref>), and herd welfare statuses (de Vries et al., <xref ref-type="bibr" rid="B27">2013a</xref>).</p>
<p>For logistical reasons it was only possible to visit farms in the South/Midlands of England, which comprised 242 of the original 468 farms. All of the 16 farm system type/herd welfare status categories were still well-represented across the 242 farms. Farms were then selected for telephone recruitment at random from within each of the 16 categories in a sequential fashion, to ensure that approximately equal numbers of farms were recruited from within the 16 categories. Farms that accepted the invitation to participate during the recruitment telephone call were recruited providing they met the following criteria: (i) intention to participate in the Milkbench&#x0002B; Profitability Benchmarking Scheme in 2013 (ensuring farm profitability data for 2013 for a related study); (ii) participation in milk recording at least every 6 weeks (ensuring availability of detailed herd milk production, milk quality, and fertility data); and (iii) use of separate housing for milking cows and dry cows/pre-calving heifers (the on-farm welfare assessment protocol focused on milking cows only and this ensured that non-milking animals were not accidentally scored).</p>
<p>Incentives to encourage participation in the study comprised on-farm feedback of mobility scoring results, farm performance benchmarking with respect to a number of key welfare outcome measures, and an overall summary report of the project findings. Also, participants were assured that the farm visit would not impact on the daily routine of the farm.</p>
</sec>
<sec>
<title>Data Collection</title>
<sec>
<title>Overview</title>
<p>All farm visits were conducted by the same assessor (SC) between mid-September 2013 and mid-April 2014. Where possible, visits coincided with the farms&#x00027; winter housing period, and each visit was conducted over 2 consecutive days. The visits comprised two main phases: an on-farm welfare outcome assessment of the milking herd, and a farmer interview. The on-farm welfare assessment took &#x0007E;12&#x02013;14 h to complete across the 2 days, depending on herd size. The interview was then undertaken on Day 2, with the member of farm staff responsible for herd health management (the &#x0201C;farmer&#x0201D;), taking &#x0007E;60 min. The farmer was asked about the farm&#x00027;s record-keeping, and the assessor took photographic or electronic copies of relevant and available herd health/welfare records for subsequent review. The farmer was also asked to read the study information sheet and sign the associated study consent form, as well as additional consent forms enabling subsequent access to the farm&#x00027;s milking recording data and British Cattle Movement Service records. The methods were approved by the Royal Veterinary College (RVC)&#x00027;s ethics committee and data were held securely in line with the RVC&#x00027;s guidelines on data confidentiality and protection.</p>
</sec>
<sec>
<title>Developing the Welfare Outcome Assessment Protocol</title>
<p>Welfare outcome measures were selected and considered for inclusion using four sources. Firstly, we conducted a review of existing assessment protocols for dairy cows developed either by animal welfare scientists and/or industry (e.g., Capdeville and Veissier, <xref ref-type="bibr" rid="B18">2001</xref>; Waiblinger et al., <xref ref-type="bibr" rid="B80">2001</xref>; Whay et al., <xref ref-type="bibr" rid="B88">2003a</xref>; Welfare Quality, <xref ref-type="bibr" rid="B84">2009</xref>; AssureWel, <xref ref-type="bibr" rid="B4">2015</xref>). Secondly, to develop and supplement the list of measures we conducted a consultation of expert opinion involving members of both the AHDB Dairy &#x0201C;Health, Welfare &#x00026; Nutrition&#x0201D; Research Partnership Work Package on welfare assessment, and the RVC Farm Animal Health and Production Group, to gather opinions on key welfare outcome measures to include in the protocol. Thirdly, we conducted UK dairy farmer and cattle vet focus groups and questionnaire survey (Collins, <xref ref-type="bibr" rid="B21">2016a</xref>) to identify the participants&#x00027; preferences for different welfare outcome measures and their opinions on potential iceberg indicators of dairy cow welfare; and lastly, we conducted on-farm assessment trial sessions at the RVC farm, and a formal pilot study on four dairy farms (selected as a convenience sample) in March 2013.</p>
<p>The individual welfare outcome measures included in the protocol needed to be valid, reliable and feasible (Winckler et al., <xref ref-type="bibr" rid="B93">2003</xref>), although this is yet to be established for many commonly used welfare measures (Knierim and Winckler, <xref ref-type="bibr" rid="B48">2009</xref>). Priority was given to already standardized welfare outcome measures [e.g., those included in Welfare Quality (<xref ref-type="bibr" rid="B84">2009</xref>)] for which these criteria had already been evaluated (e.g., Forkman and Keeling, <xref ref-type="bibr" rid="B38">2009</xref>; Knierim and Winckler, <xref ref-type="bibr" rid="B48">2009</xref>). When selecting measures for which multiple standardized versions were available, preference was given to UK dairy industry recommended measures [e.g., the AHDB Dairy Mobility Score and AssureWel cleanliness, abrasions and swellings scores were selected over equivalent Welfare Quality<sup>&#x000AE;</sup> measures (AHDB Dairy, <xref ref-type="bibr" rid="B2">2015b</xref>; AssureWel, <xref ref-type="bibr" rid="B4">2015</xref>)]. This was so that the assessment results could be meaningfully compared with existing UK studies, and the data generated could be easily interpreted by participating farmers.</p>
</sec>
<sec>
<title>Welfare Outcome Measures</title>
<p>The final welfare outcome assessment protocol featured measures related to different aspects of dairy cow production, health, physical condition and behavior. Welfare outcome measures were assessed at the cow-, cow group- or herd-level using lactating cow groups, and then summarized at the herd-level. <xref ref-type="supplementary-material" rid="SM1">Supplementary Table 1</xref> provides an overview of the structure and content of the protocol. Full details of the structure and content of the protocol&#x02014;including exact assessment procedures, descriptions of case definitions (e.g., &#x0201C;lame,&#x0201D; &#x0201C;dirty,&#x0201D; &#x0201C;aggressive head-butt,&#x0201D; etc.) and detailed procedures for summarizing the data collected at the herd-level&#x02014;are provided in <xref ref-type="supplementary-material" rid="SM2">Supplementary Table 2</xref>.</p>
<p>Unfortunately, the prevalence and/or incidence of the &#x0201C;health event&#x0201D; welfare outcomes referred to in <xref ref-type="supplementary-material" rid="SM1">Supplementary Tables 1</xref>, <xref ref-type="supplementary-material" rid="SM2">2</xref> (i.e., mastitis, lameness, dystocia, milk fever, retained fetal membranes, metritis/endometritis, and displaced abomasums) could not ultimately be calculated. This was because most farm records were found to be of insufficient quality or quantity to provide suitably robust data for analysis, and so these measures could not be included in our final welfare outcome dataset.</p>
</sec>
<sec>
<title>Intra-Observer Reliability of Welfare Outcome Scoring</title>
<p>To help to ensure a good level of intra-observer reliability, the assessor underwent official training to measure the welfare outcomes included, where this was available (e.g., for the Welfare Quality<sup>&#x000AE;</sup> measures and the AHDB Dairy Mobility Score). Additionally, the assessor practiced data collection during the pilot studies, and the intra-observer reliability of the assessor&#x00027;s scoring was then formally assessed.</p>
<p>To develop suitable intra-observer reliability tests, relevant photographs and/or video footage of cows were collected during the assessment trial sessions, pilot study, and first few farm visits. Tests were successfully developed for individual qualitative descriptors (QDs), time taken to lie down, collisions during lying down, the continuous behavior sampling measures, mobility, body condition, cleanliness, abrasions, swellings, ocular discharge, nasal discharge, vulval discharge, diarrhea, injured tails, and cows lying incorrectly. Hampered respiration, chase-ups of lying cows, fighting bouts, or chasing bouts could not be included because they were too infrequent to capture on film. The response to assessor was also excluded because it was difficult to replicate using photographs/video footage. The developed agreement tests were undertaken at the beginning, middle, and end of the farm visit period and the results obtained at the three different time points were statistically compared.</p>
</sec>
</sec>
<sec>
<title>Statistical Analysis</title>
<sec>
<title>Investigating the Pairwise Relationships Between the Different Welfare Outcome Measures</title>
<p>All statistical analyses were completed using IBM SPSS Statistics v.22 and a type I error rate of 0.05 was used in all statistical tests. Pairwise analysis was important for identifying the degrees to which outcomes were correlated in an initial exploratory analysis, related to finding outcomes that were highly predictive of other outcomes. Pairwise relationships between continuous welfare outcome measures were investigated using correlations. Pearson&#x00027;s correlation tests were used when both measures were normally distributed. Where data were not normally distributed, natural logarithm or square root transformations were applied in an attempt to achieve normal distribution. Negatively skewed data were reversed prior to this. Variables with excessive zeros could not be transformed to achieve normal distribution.</p>
<p>Spearman&#x00027;s rank correlations were used when one or both of the measures could not be transformed to achieve normal distribution. The relationship between the various continuous welfare outcome measures and response to assessor (the only categorical welfare outcome measure in the protocol) was investigated using logistic regression. No correction was made to the p values to adjust for multiple testing, due to the exploratory nature of these various pairwise analyses (Bender and Lange, <xref ref-type="bibr" rid="B6">2001</xref>).</p>
</sec>
<sec>
<title>Determining Herd Overall Welfare Status</title>
<p>To investigate whether any individual welfare outcome measures could predict the overall welfare status of herds, it was first necessary to develop a method for determining as closely as possible the herds&#x00027; overall welfare status. Instead of condensing measures using specific &#x0201C;aggregation rules&#x0201D; informed mainly by expert opinion as in previous studies (e.g., Bracke et al., <xref ref-type="bibr" rid="B14">2002</xref>; Botreau et al., <xref ref-type="bibr" rid="B12">2008</xref>, <xref ref-type="bibr" rid="B13">2009</xref>; Calamari and Bertoni, <xref ref-type="bibr" rid="B17">2009</xref>), we attempted to aggregate measures into a composite overall welfare scale on the basis of their observed inter-relationships using PCA.</p>
<p>As PCA cannot be undertaken on variables with a lot of missing data or with low variance, such measures were excluded. These were QD distressed, frustrated and bored, mean number of chase ups, chasing bouts and fighting bouts per cow/hour, mean time to lie down, % collisions during lying down, all of the automatically recorded lying behavior measures, % cows dull and depressed, all of the substantial swelling measures except % cows with substantial swelling on the hind leg, % cows with lesions on the udder, % cows with diarrhea, % cows with hampered respiration, all of the milk recording measures, and all of the mortality measures. Also, due to multi-collinearity, in any identified &#x02265;0.9 pairwise correlation the variable with the smallest number of correlations with other measures was excluded. This was particularly important for outcomes comprising several similar welfare measures, such as several alternative measures of cleanliness. Thus, all closely related alternative measures were removed before conducting the PCA (Field, <xref ref-type="bibr" rid="B37">2013</xref>).</p>
<p>Principal components with eigenvalues of &#x02265;1 were reviewed and interpreted on the basis of the various measures&#x00027; factor loadings. Factor loadings of &#x02265;0.4 were used as a threshold to indicate a meaningful association. In line with similar existing studies (e.g., Veissier et al., <xref ref-type="bibr" rid="B79">2004</xref>; Van Reenen et al., <xref ref-type="bibr" rid="B76">2005</xref>) the first principal component, which accounts for the most variance within a given dataset, was taken forward as our measure of the composite welfare scale (being the largest single aggregate measure of the originally submitted variables).</p>
<p>To avoid the circularity of investigating relationships between each individual measure and a composite scale within which it was nested (Heath et al., <xref ref-type="bibr" rid="B43">2014a</xref>), multiple composite welfare scales were generated using the PCA method described, each time excluding the welfare outcome measure to be tested against it. In instances where there were multiple versions of the same measure (e.g., dirty and very dirty, or lame and very lame) all versions were excluded. This allowed us to test the extent to which each individual measure could predict the composite welfare scale as summarized by all other variables (e.g., &#x0201C;does the percentage of lame cows correlate with the composite welfare scale when the percentage of lame cows has been excluded from that composite scale?&#x0201D;).</p>
<p>Intra-class correlation coefficients were used to test the level of agreement between the various newly generated composite welfare scales and the original composite welfare scale to investigate any likely reduction in validity resulting from the systematic exclusion approach. Correlation between these scales was found to be statistically significant and very high (correlation coefficient &#x0003E;0.9) in every case (<italic>p</italic> &#x0003C; 0.001). Therefore, we proceeded to test individual variables against their own complementary composite PCA scales as proxy measures of herd overall welfare status.</p>
</sec>
<sec>
<title>Identifying Iceberg Indicators of Dairy Herd Welfare</title>
<p>Linear regression analysis was used to investigate whether any of the individual welfare outcome measures could predict herd overall welfare status (i.e., the measures&#x00027; respective composite welfare scale). Measures excluded from the original PCA (e.g., due to missing data or multicollinearity) were compared with the original composite welfare scale. In addition, a separate PCA was undertaken on the 20 QD terms included in the protocol which produced (three) summary measures of the QBA (labeled, on the basis of factor loadings, herd &#x0201C;contentedness,&#x0201D; &#x0201C;agitation,&#x0201D; and &#x0201C;sociability&#x0201D;). Relationships between each of these three QBA principal components and their respective complementary composite welfare scales were also investigated using separate linear regressions. This additional PCA was conducted because QDs are not advised to be used independently of each other (e.g., Welfare Quality, <xref ref-type="bibr" rid="B84">2009</xref>), whereas we also needed to test the descriptors independently in this exploratory study, because of our aim being to investigate whether any of the measures, including any QDs, could be removed.</p>
<p>Finally, all individual measures that were significantly associated with their complementary composite welfare scale were taken forward to a second stage of analysis of herd welfare status categories. This was necessary because, in applied contexts such as welfare assurance labeling schemes for consumers, welfare is summarized as categories [e.g., poor, acceptable, enhanced, or excellent (Welfare Quality, <xref ref-type="bibr" rid="B84">2009</xref>)], rather than on a continuum (Webster et al., <xref ref-type="bibr" rid="B83">2004</xref>; Honey, <xref ref-type="bibr" rid="B46">2013</xref>). This second stage therefore investigated the ability of the individual welfare outcome measures to predict herd welfare categories that were created from the composite welfare scales. To do this, farms were categorized into quartiles on the composite welfare scales, creating four potential categories of overall &#x0201C;welfare status.&#x0201D; Mirroring this, farms were also categorized into quartiles for each significant individual welfare outcome measure. Agreement between the quartile allocations of each individual measure <italic>versus</italic> quartiles of the complementary welfare scale was assessed using predictive accuracy (% farms correctly classified), Cohen&#x00027;s Kappa statistic and Kendall&#x00027;s coefficient of concordance. This allowed us to test the extent to which farm categories that were created using each individual measure would match the farm categories that were created using the complementary welfare scales. Perfect agreement would indicate that the allocation of farms into quartiles according to an individual measure exactly matched the quartile allocation for the complementary welfare scale. Once again, due to the exploratory nature of the regression and agreement analyses used to explore the iceberg indicator question, no correction was made to the p values to adjust for multiplicity (Bender and Lange, <xref ref-type="bibr" rid="B6">2001</xref>).</p>
</sec>
</sec>
</sec>
<sec sec-type="results" id="s3">
<title>Results</title>
<sec>
<title>Description of Farm Sample</title>
<p>In total 52 farms (each with a single herd) were recruited into the study. This was the number of farms practically possible to visit within the study period. One farm withdrew its participation before its visit and, therefore, the cross-sectional study was undertaken on a total of 51 farms. The median milking herd size of the farms was 180 cows (IQR = 84; min. = 57; max. = 1,545). Median days in milk ranged between 25 and 252 across the different herds (median = 179; IQR = 60), and the mean 305 day milk yield/cow was 8,290.9 L (SD = 1,622.7; min. = 4,742.3; max. = 11,608.1). Median milking cow parity was 2 (IQR = 1&#x02013;3). Descriptive statistics for key categorical farm management variables for the final 51 farms assessed are displayed in <xref ref-type="table" rid="T1">Table 1</xref>. Most farms had Holstein Friesian or Holstein cows. Most farms had all-year-round calving, and cubicle housing systems, and most milked twice daily using a non-robotic system. The median number of days at grass during 2013 was 193.5 days (IQR = 82.3; min. = 0.0; max. = 294.0).</p>
<table-wrap position="float" id="T1">
<label>Table 1</label>
<caption><p>Descriptive statistics for the categorical farm management variables for the 51 cross-sectional study farms.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Variable</bold></th>
<th valign="top" align="left"><bold>Category</bold></th>
<th valign="top" align="center"><bold>% of farms</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Predominant cow breed</td>
<td valign="top" align="left">Holstein Friesian</td>
<td valign="top" align="center">41.2</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Holstein</td>
<td valign="top" align="center">33.3</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Friesian</td>
<td valign="top" align="center">9.8</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Cross bred or mixed breeds</td>
<td valign="top" align="center">9.8</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Jersey</td>
<td valign="top" align="center">3.9</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Brown Swiss</td>
<td valign="top" align="center">2.0</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Missing</td>
<td valign="top" align="center">0.0</td>
</tr>
<tr>
<td valign="top" align="left">Milking cow housing<sup>&#x02020;</sup></td>
<td valign="top" align="left">Cubicles (24 h/day)</td>
<td valign="top" align="center">56.9</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Mixed <xref ref-type="table-fn" rid="TN1"><sup>&#x02021;</sup></xref></td>
<td valign="top" align="center">17.8</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Cubicles and straw yards (24 h/day)</td>
<td valign="top" align="center">13.7</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Straw yard (24 h/day)</td>
<td valign="top" align="center">7.8</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Pasture (24 h/day)</td>
<td valign="top" align="center">3.9</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Missing</td>
<td valign="top" align="center">0.0</td>
</tr>
<tr>
<td valign="top" align="left">Calving pattern</td>
<td valign="top" align="left">All-year-round calving</td>
<td valign="top" align="center">54.9</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Multi-block calving</td>
<td valign="top" align="center">19.6</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Autumn calving</td>
<td valign="top" align="center">13.7</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Spring calving</td>
<td valign="top" align="center">7.8</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Other</td>
<td valign="top" align="center">3.9</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Missing</td>
<td valign="top" align="center">0.0</td>
</tr>
<tr>
<td valign="top" align="left">Milking system</td>
<td valign="top" align="left">Non-robotic</td>
<td valign="top" align="center">88.2</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Robotic</td>
<td valign="top" align="center">11.8</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Missing</td>
<td valign="top" align="center">0.0</td>
</tr>
<tr>
<td valign="top" align="left">Milking frequency</td>
<td valign="top" align="left">2 &#x000D7; day</td>
<td valign="top" align="center">91.1</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">3 &#x000D7; day</td>
<td valign="top" align="center">8.9</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Missing</td>
<td valign="top" align="center">0.0</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Data are arranged in descending order of prevalence for each variable</italic>.</p>
<p><italic>At the time of the farm visit (excluding cow groups representing &#x0003C;10% of herd)</italic>.</p>
<fn id="TN1">
<label>&#x02021;</label>
<p><italic>Either some cow groups were at pasture/housed or cow groups were at pasture during the day and housed during the night</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
<sec>
<title>Welfare Outcome Measure Descriptive Statistics</title>
<p>Descriptive statistics for the final 96 behavior-based, health and physical condition-based and records-based welfare outcome measures are displayed in <xref ref-type="supplementary-material" rid="SM3">Supplementary Table 3</xref>. The QDs receiving the highest visual analog scale (VAS) scores across farms were &#x0201C;relaxed,&#x0201D; &#x0201C;calm,&#x0201D; &#x0201C;positively occupied,&#x0201D; and &#x0201C;content.&#x0201D; The median herd mean for percentage cows feeding was 33.0%, and of ruminating was 27.4%. The median herd mean of cows lying down was 41.9%, and they lay down for a median herd mean of 10.5 h/day; only 0.1% lay incorrectly, but 23.9% of those observed lying down collided with housing equipment. A median of 1.2 agonistic episodes were seen per cow/h, whilst 0.2 equivalent episodes of social licking were seen. A median of 22.2% of cows per herd were lame, and 5.7% very lame. A median of 26.5% of cows/herd had nasal discharge, and a median of 18.6% had high somatic cell counts on their most recent test day. The median mortality was 5.3&#x02013;6% cows/herd dying on farm, depending on the year. During the response to assessor test, 72.3% of herds were assessed as &#x0201C;calm/relaxed,&#x0201D; and 27.7% were assessed as &#x0201C;nervous/wary.&#x0201D;</p>
<p>The variation observed in welfare performance across the 51 farms depends on the individual welfare outcome measure in question. For example, the % of cows with swellings on their hind legs varied by 53.1% (0 &#x02013; 53.1%) across the different herds, whereas the cows with swellings on their udders varied by just 4.2% (0 &#x02013; 4.2%). As noted previously some welfare outcomes were rare, with many zeros. For example, at least 75% of the farms received scores of zero for 17 of the 96 welfare outcome measures (QD fearful, frustrated, bored and distressed; mean no. of fighting and chasing bouts per cow/h; % cows with diarrhea, hampered respiration or swelling on their udder; % cows with lesions on their udder, head/neck/shoulders and foreleg; and % cows with substantial swelling on the five body areas investigated).</p>
</sec>
<sec>
<title>Intra-observer Reliability</title>
<p>Intra-observer reliability was very good for all of the categorical welfare outcome measures tested (<xref ref-type="supplementary-material" rid="SM4">Supplementary Tables 4</xref>, <xref ref-type="supplementary-material" rid="SM5">5</xref>). Cohen&#x00027;s Kappa values of &#x0003E;0.60 [indicating &#x0201C;substantial&#x0201D; agreement (Landis and Koch, <xref ref-type="bibr" rid="B50">1977</xref>)], and Kendall&#x00027;s coefficient of concordance values of &#x0003E;0.70 [indicating &#x0201C;strong&#x0201D; agreement (Schmidt, <xref ref-type="bibr" rid="B70">1997</xref>)] were consistently achieved across all three timepoints, for all measures. Intra-observer reliability was also good for the continuous welfare outcome measures tested. Intra class correlation coefficients of &#x0003E;0.40 (indicating &#x0201C;fair&#x0201D; reliability; Cicchetti, <xref ref-type="bibr" rid="B20">1994</xref>) were consistently observed for all measures, with the exception of &#x0201C;QD happy&#x0201D; (which was weaker: coefficient = 0.17&#x02013;0.44). Furthermore, for most comparisons, coefficients of &#x0003E;0.75 (indicating a &#x0201C;good&#x0201D; level of reliability) were achieved. It must be noted, however, that agreement was not always statistically significant for a number of the QD measures; the lack of significance could be because these agreement tests were based on only five observations due to little suitable video footage, whereas the tests for the other measures were based on between 18 and 60 observations.</p>
</sec>
<sec>
<title>Pairwise Associations Between Individual Welfare Outcome Measures</title>
<p>Each of the 95 continuous welfare outcome measures was significantly correlated with at least one other measure. Most significant correlations were at best only &#x0201C;moderate&#x0201D; in strength i.e., 0.4 to 0.7 (Martin and Bateson, <xref ref-type="bibr" rid="B52">2007</xref>). Only 12 correlations were &#x0201C;high&#x0201D; strength (&#x02265;0.7 to &#x0003C;0.9) and only five &#x0201C;very high&#x0201D; strength (&#x02265;0.9), and these generally comprised pairs of measures that captured aspects of the same welfare outcome (e.g., &#x0201C;dirty&#x0201D; and &#x0201C;very dirty&#x0201D; hindquarters; <xref ref-type="table" rid="T2">Table 2</xref>).</p>
<table-wrap position="float" id="T2">
<label>Table 2</label>
<caption><p>Very high and high pairwise correlations detected between the 95 continuous welfare outcome measures.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Correlation strength</bold></th>
<th valign="top" align="left"><bold>Variable A</bold></th>
<th valign="top" align="left"><bold>Variable B</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">Very highly correlated measures (&#x02265;0.9)</td>
<td valign="top" align="left">Median calving interval</td>
<td valign="top" align="left">Median calving to conception interval</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Median age at first calving</td>
<td valign="top" align="left">Median age at second calving</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Mean milk protein at first MR test day postpartum</td>
<td valign="top" align="left">Mean milk protein at second MR test day postpartum</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Mean no. of total agonistic social behaviors/cow/hour</td>
<td valign="top" align="left">Mean no. of gentle head butts/cow/hour</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">QD relaxed</td>
<td valign="top" align="left">QD calm</td>
</tr>
<tr>
<td valign="top" align="left">Highly correlated measures (&#x02265;0.7 to &#x0003C;0.9)</td>
<td valign="top" align="left">% cows with dirty hindquarters</td>
<td valign="top" align="left">% cows with very dirty hindquarters</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">% cows with dirty hind legs</td>
<td valign="top" align="left">% cows with very dirty hind legs</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">% cows with dirty udders</td>
<td valign="top" align="left">% cows with very dirty hindquarters</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Mean % of MR test days with high milk SCC previous 12 months</td>
<td valign="top" align="left">% cows with high milk SCC at the most recent MR test day</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">Mean no. of total agonistic social behaviors/cow/hour</td>
<td valign="top" align="left">Mean no. of displacements/cow/hour</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">QBA PC &#x0201C;contentedness&#x0201D;</td>
<td valign="top" align="left">QD content</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">QBA PC &#x0201C;contentedness&#x0201D;</td>
<td valign="top" align="left">QD happy</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">QBA PC &#x0201C;contentedness&#x0201D;</td>
<td valign="top" align="left">QD relaxed</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">QBA PC &#x0201C;contentedness&#x0201D;</td>
<td valign="top" align="left">QD calm</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">QD calm</td>
<td valign="top" align="left">QD content</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">QD relaxed</td>
<td valign="top" align="left">QD content</td>
</tr>
<tr>
<td/>
<td valign="top" align="left">QD content</td>
<td valign="top" align="left">QD happy</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>All correlations were significant at p &#x0003C; 0.001</italic>.</p>
<p><italic>QBA, qualitative behavior assessment; QD, qualitative descriptor; PC, principal component; MR, milk recording; SCC, somatic cell count</italic>.</p>
<p><italic>Variable A and Variable B are arbitrary labels and could be interchanged between variables within a pair</italic>.</p>
</table-wrap-foot>
</table-wrap>
<p>The percentage of lame cows significantly correlated with the largest number of other measures (33), whereas both &#x0201C;mean number of chases/cow/hour&#x0201D; and &#x0201C;percentage heifer calves died on-farm 2012&#x0201D; significantly correlated with the fewest measures (one each). <xref ref-type="supplementary-material" rid="SM6">Supplementary Table 6</xref> displays the welfare outcome measures that were at least moderately correlated with 10 or more other measures for information.</p>
<p>Finally, there were also significant pairwise relationships between four of the 95 continuous welfare outcome measures and the herds&#x00027; response to assessor, which was recorded as the proportion of cows &#x0201C;calm/relaxed&#x0201D; vs. &#x0201C;nervous/wary&#x0201D; (0 vs. 1, respectively). These were &#x0201C;SD no. of lying bouts/day&#x0201D; (Coeff&#x0002B;/&#x02212;S.E = 1.2&#x0002B;/&#x02212;0.4; <italic>p</italic> = 0.016), &#x0201C;Percentage cows with low protein on first/second MR test day postpartum&#x0201D; (16.3&#x0002B;/&#x02212;6.6; <italic>p</italic> = 0.019), age at first calving (60.7&#x0002B;/&#x02212;20.2; <italic>p</italic> = 0.005), and age at second calving log<sub>10</sub> (0.03&#x0002B;/&#x02212;0.01; <italic>p</italic> = 0.004).</p>
</sec>
<sec>
<title>Determining Herd Overall Welfare Status</title>
<p>The PCA to create the overall welfare outcome scale reduced the 56 welfare outcome measures that could be included into 17 principal components, which together explained 85.3% of the variance in the dataset. Some missing data were tolerated within the dataset, but this meant that principal component scores could only be generated for 41 of the 51 farms. The first principal component (PC 1) explained 16.9% of the variance. <xref ref-type="table" rid="T3">Table 3</xref> summarizes the 23 welfare outcome measures which had factor loadings of &#x0003E;0.4 for PC 1. On the basis of these factor loadings, it can be interpreted that farms with higher positive scores for PC 1 had a poorer welfare status. For example, they received lower QD happy and QD content scores and higher scores for QD apathetic and QD uneasy, and had higher percentages of dirty and lame cows. Overall, given both the breadth and strong welfare relevance of the 23 individual welfare outcomes measures with factor loadings of &#x0003E;0.4 for PC 1, this principal component was deemed a suitable proxy measure of herd welfare status for the purposes of the iceberg indicator analyses. Beyond PC 1, the other principal components were more difficult to interpret and less obviously relevant to welfare (Collins, <xref ref-type="bibr" rid="B22">2016b</xref>). The decrease in their explanatory value upon examining the scree plot was fairly gradual rather than there being a clear step change, meaning there was no obvious &#x0201C;top&#x0201D; set of principal components to consider as the most important. Thus, despite the fairly low percentage of variance that PC 1 explained on its own, it was taken forward as the relevant composite welfare scale against which potential iceberg indicator measures could be tested.</p>
<table-wrap position="float" id="T3">
<label>Table 3</label>
<caption><p>The 23 welfare outcome measures with factor loadings &#x0003E;0.4 for principal component 1.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Welfare outcome measure</bold></th>
<th valign="top" align="center"><bold>Factor loading</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">% lame cows</td>
<td valign="top" align="center">0.72</td>
</tr>
<tr>
<td valign="top" align="left">% cows with very dirty udders</td>
<td valign="top" align="center">0.72</td>
</tr>
<tr>
<td valign="top" align="left">% cows with very dirty hindquarters SQRT</td>
<td valign="top" align="center">0.68</td>
</tr>
<tr>
<td valign="top" align="left">% cows with swelling rest of body</td>
<td valign="top" align="center">0.66</td>
</tr>
<tr>
<td valign="top" align="left">QD relaxed REV log<sup>10</sup></td>
<td valign="top" align="center">0.65</td>
</tr>
<tr>
<td valign="top" align="left">Mean % cows ruminating</td>
<td valign="top" align="center">0.63</td>
</tr>
<tr>
<td valign="top" align="left">QD apathetic</td>
<td valign="top" align="center">0.63</td>
</tr>
<tr>
<td valign="top" align="left">% cows with dirty udders</td>
<td valign="top" align="center">0.60</td>
</tr>
<tr>
<td valign="top" align="left">% cows with dirty hindquarters</td>
<td valign="top" align="center">0.59</td>
</tr>
<tr>
<td valign="top" align="left">QD uneasy</td>
<td valign="top" align="center">0.56</td>
</tr>
<tr>
<td valign="top" align="left">% cows with nasal discharge</td>
<td valign="top" align="center">0.47</td>
</tr>
<tr>
<td valign="top" align="left">% cows with very dirty hind legs</td>
<td valign="top" align="center">0.47</td>
</tr>
<tr>
<td valign="top" align="left">Mean no. of coughs/cow/15 min</td>
<td valign="top" align="center">0.46</td>
</tr>
<tr>
<td valign="top" align="left">QD indifferent</td>
<td valign="top" align="center">0.43</td>
</tr>
<tr>
<td valign="top" align="left">% cows with dirty hind legs</td>
<td valign="top" align="center">0.42</td>
</tr>
<tr>
<td valign="top" align="left">% very lame cows log<sup>10</sup></td>
<td valign="top" align="center">0.41</td>
</tr>
<tr>
<td valign="top" align="left">QD lively</td>
<td valign="top" align="center">&#x02212;0.44</td>
</tr>
<tr>
<td valign="top" align="left">Mean % cows feeding</td>
<td valign="top" align="center">&#x02212;0.47</td>
</tr>
<tr>
<td valign="top" align="left">QD active</td>
<td valign="top" align="center">&#x02212;0.48</td>
</tr>
<tr>
<td valign="top" align="left">QD friendly log<sup>10</sup></td>
<td valign="top" align="center">&#x02212;0.49</td>
</tr>
<tr>
<td valign="top" align="left">QD positively occupied</td>
<td valign="top" align="center">&#x02212;0.69</td>
</tr>
<tr>
<td valign="top" align="left">QD content</td>
<td valign="top" align="center">&#x02212;0.75</td>
</tr>
<tr>
<td valign="top" align="left">QD happy</td>
<td valign="top" align="center">&#x02212;0.81</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Welfare outcome measures are arranged in descending order of loading onto principal component 1. Measures with factor loadings below 0.4 are not shown</italic>.</p>
<p><italic>QBA, qualitative behavior assessment; QD, qualitative descriptor; SQRT, square root transformed; log<sup>10</sup>, natural logarithm transformed; REV, reversed prior to transformation</italic>.</p>
</table-wrap-foot>
</table-wrap>
</sec>
<sec>
<title>Identifying Iceberg Indicators of Dairy Herd Welfare</title>
<p>Linear regressions revealed that 22 of the 96 welfare outcome measures were significantly associated with their respective composite welfare scales (<xref ref-type="table" rid="T4">Table 4</xref>). Most correlated in the expected direction; that is, most measures of poor welfare (e.g., dirty udders and coughs) correlated positively with the composite welfare scale, and most measures of good welfare (e.g., QD happy and QD content) correlated negatively. Although percentage cows ruminating, QD relaxed and QD calm appear to be exceptions, this was an artifact resulting from these variables being reversed during statistical transformation to correct for skewness.</p>
<table-wrap position="float" id="T4">
<label>Table 4</label>
<caption><p>Simple linear regression model results of the significant relationships between the individual welfare outcome measures and their respective composite welfare scales.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Welfare outcome measure</bold></th>
<th valign="top" align="left"><bold>Coefficient &#x0002B;/&#x02212; S.E</bold>.</th>
<th valign="top" align="center"><bold><italic>P</italic>-value</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">% cows with dirty udders log<sup>10</sup></td>
<td valign="top" align="left">1.637 &#x0002B;/&#x02212; 0.466</td>
<td valign="top" align="center">0.001</td>
</tr>
<tr>
<td valign="top" align="left">Mean no. coughs/cow/15 min SQRT</td>
<td valign="top" align="left">1.390 &#x0002B;/&#x02212; 0.497</td>
<td valign="top" align="center">0.008</td>
</tr>
<tr>
<td valign="top" align="left">QD relaxed REV log<sup>10</sup></td>
<td valign="top" align="left">1.384 &#x0002B;/&#x02212; 0.311</td>
<td valign="top" align="center">&#x0003C;0.001</td>
</tr>
<tr>
<td valign="top" align="left">% very lame cows log<sup>10</sup></td>
<td valign="top" align="left">1.058 &#x0002B;/&#x02212; 0.496</td>
<td valign="top" align="center">0.039</td>
</tr>
<tr>
<td valign="top" align="left">% cows dull and depressed</td>
<td valign="top" align="left">0.637 &#x0002B;/&#x02212; 0.256</td>
<td valign="top" align="center">0.017</td>
</tr>
<tr>
<td valign="top" align="left">% cows with very dirty hindquarters SQRT</td>
<td valign="top" align="left">0.316 &#x0002B;/&#x02212; 0.075</td>
<td valign="top" align="center">&#x0003C;0.001</td>
</tr>
<tr>
<td valign="top" align="left">% cows with swelling rest of body</td>
<td valign="top" align="left">0.289 &#x0002B;/&#x02212; 0.061</td>
<td valign="top" align="center">&#x0003C;0.001</td>
</tr>
<tr>
<td valign="top" align="left">QD uneasy</td>
<td valign="top" align="left">0.286 &#x0002B;/&#x02212; 0.080</td>
<td valign="top" align="center">0.001</td>
</tr>
<tr>
<td valign="top" align="left">QD calm REV SQRT</td>
<td valign="top" align="left">0.277 &#x0002B;/&#x02212; 0.055</td>
<td valign="top" align="center">&#x0003C;0.001</td>
</tr>
<tr>
<td valign="top" align="left">% cows with very dirty hind legs SQRT</td>
<td valign="top" align="left">0.188 &#x0002B;/&#x02212; 0.075</td>
<td valign="top" align="center">0.016</td>
</tr>
<tr>
<td valign="top" align="left">% cows with hind leg swelling SQRT</td>
<td valign="top" align="left">0.184 &#x0002B;/&#x02212; 0.089</td>
<td valign="top" align="center">0.046</td>
</tr>
<tr>
<td valign="top" align="left">% lame cows</td>
<td valign="top" align="left">0.074 &#x0002B;/&#x02212; 0.014</td>
<td valign="top" align="center">&#x0003C;0.001</td>
</tr>
<tr>
<td valign="top" align="left">Mean % cows ruminating</td>
<td valign="top" align="left">0.069 &#x0002B;/&#x02212; 0.016</td>
<td valign="top" align="center">&#x0003C;0.001</td>
</tr>
<tr>
<td valign="top" align="left">% cows with nasal discharge</td>
<td valign="top" align="left">0.034 &#x0002B;/&#x02212; 0.012</td>
<td valign="top" align="center">0.006</td>
</tr>
<tr>
<td valign="top" align="left">% cows with dirty hindquarters</td>
<td valign="top" align="left">0.022 &#x0002B;/&#x02212; 0.007</td>
<td valign="top" align="center">0.003</td>
</tr>
<tr>
<td valign="top" align="left">% cows with low protein at the first or second MR test day postpartum</td>
<td valign="top" align="left">0.018 &#x0002B;/&#x02212; 0.008</td>
<td valign="top" align="center">0.031</td>
</tr>
<tr>
<td valign="top" align="left">QD active</td>
<td valign="top" align="left">&#x02212;0.016 &#x0002B;/&#x02212; 0.005</td>
<td valign="top" align="center">0.006</td>
</tr>
<tr>
<td valign="top" align="left">QD positively occupied</td>
<td valign="top" align="left">&#x02212;0.033 &#x0002B;/&#x02212; 0.006</td>
<td valign="top" align="center">&#x0003C;0.001</td>
</tr>
<tr>
<td valign="top" align="left">Mean % cows feeding</td>
<td valign="top" align="left">&#x02212;0.035 &#x0002B;/&#x02212; 0.013</td>
<td valign="top" align="center">0.008</td>
</tr>
<tr>
<td valign="top" align="left">QD content</td>
<td valign="top" align="left">&#x02212;0.041 &#x0002B;/&#x02212; 0.007</td>
<td valign="top" align="center">&#x0003C;0.001</td>
</tr>
<tr>
<td valign="top" align="left">QD happy</td>
<td valign="top" align="left">&#x02212;0.061 &#x0002B;/&#x02212; 0.008</td>
<td valign="top" align="center">&#x0003C;0.001</td>
</tr>
<tr>
<td valign="top" align="left">QBA PC &#x0201C;contentedness&#x0201D;</td>
<td valign="top" align="left">&#x02212;0.571 &#x0002B;/&#x02212; 0.128</td>
<td valign="top" align="center">&#x0003C;0.001</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Welfare outcome measures are arranged in order of descending correlation and regression coefficients</italic>.</p>
<p><italic>QBA, qualitative behavior assessment; QD, qualitative descriptor; PC, principal component; SQRT, square root transformed; log<sup>10</sup>, natural logarithm transformed; REV, reversed prior to transformation</italic>.</p>
</table-wrap-foot>
</table-wrap>
<p><xref ref-type="table" rid="T4">Table 4</xref> shows that most correlations were with measures that had high loading onto PC 1, but a further five measures that could not be included in the PCA were also among those correlating with the composite welfare scale (these were: percentage cows dull and depressed, QD calm, percentage cows with hind leg swelling, percentage cows with low protein at the first or second MR test day postpartum, and the QBA &#x0201C;contentedness&#x0201D; principal component). Conversely, seven measures that had high loading on the composite welfare scale, did not show significant correlations with their respective composite welfare scales (these were: percentage cows with very dirty udders, percentage cows with dirty and very dirty hindlegs, QD apathetic, QD indifferent, QD lively, and QD friendly).</p>
<p>When the above 22 welfare outcomes were tested for their ability to predict composite welfare categories (i.e., each herd&#x00027;s quartile allocation), absolute agreement was at best only reasonable (<xref ref-type="table" rid="T5">Table 5</xref>). Most measures correctly classified &#x0003C;50% of the farms. Kappa statistics were often &#x0003C;0.2 [which indicates only &#x0201C;slight&#x0201D; agreement (Landis and Koch, <xref ref-type="bibr" rid="B50">1977</xref>)]. Agreement on the basis of Kendall&#x00027;s coefficient of concordance, which accounts for the magnitude of any misclassifications, was often &#x0003E;0.7 indicating &#x0201C;strong&#x0201D; agreement (Schmidt, <xref ref-type="bibr" rid="B70">1997</xref>). Overall, QD calm, QD happy and percentage of lame cows achieved the greatest level of agreement with their respective composite welfare categories. These all had Kendall&#x00027;s coefficients of concordance &#x0003E;0.7, and were the only measures to correctly classify (just) over 50% of farms and to obtain Cohen&#x00027;s Kappa statistics approaching &#x0003E;0.4 (the threshold indicating at least &#x0201C;moderate&#x0201D; agreement).</p>
<table-wrap position="float" id="T5">
<label>Table 5</label>
<caption><p>The relative performance of welfare outcome measures in predicting their respective composite welfare categories.</p></caption>
<table frame="hsides" rules="groups">
<thead><tr>
<th valign="top" align="left"><bold>Welfare outcome measure</bold></th>
<th valign="top" align="left"><bold>% correctly classified</bold></th>
<th valign="top" align="left"><bold>Cohen&#x00027;s Kappa</bold></th>
<th valign="top" align="left"><bold>Kendall&#x00027;s coefficient of concordance</bold></th>
</tr>
</thead>
<tbody>
<tr>
<td valign="top" align="left">QD content</td>
<td valign="top" align="left">48.8</td>
<td valign="top" align="left">0.32<xref ref-type="table-fn" rid="TN2"><sup>&#x0002A;&#x0002A;&#x0002A;</sup></xref></td>
<td valign="top" align="left">0.87<xref ref-type="table-fn" rid="TN3"><sup>&#x0002A;&#x0002A;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">QD happy</td>
<td valign="top" align="left">51.2</td>
<td valign="top" align="left">0.35<xref ref-type="table-fn" rid="TN2"><sup>&#x0002A;&#x0002A;&#x0002A;</sup></xref></td>
<td valign="top" align="left">0.87<xref ref-type="table-fn" rid="TN4"><sup>&#x0002A;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">QD calm REV SQRT</td>
<td valign="top" align="left">51.2</td>
<td valign="top" align="left">0.35<xref ref-type="table-fn" rid="TN2"><sup>&#x0002A;&#x0002A;&#x0002A;</sup></xref></td>
<td valign="top" align="left">0.81<xref ref-type="table-fn" rid="TN3"><sup>&#x0002A;&#x0002A;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">QD positively occupied</td>
<td valign="top" align="left">36.6</td>
<td valign="top" align="left">0.15<xref ref-type="table-fn" rid="TN4"><sup>&#x0002A;</sup></xref></td>
<td valign="top" align="left">0.79<xref ref-type="table-fn" rid="TN4"><sup>&#x0002A;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">Mean % cows ruminating</td>
<td valign="top" align="left">41.5</td>
<td valign="top" align="left">0.22<xref ref-type="table-fn" rid="TN3"><sup>&#x0002A;&#x0002A;</sup></xref></td>
<td valign="top" align="left">0.79<xref ref-type="table-fn" rid="TN4"><sup>&#x0002A;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">QD relaxed REV log<sup>10</sup></td>
<td valign="top" align="left">48.8</td>
<td valign="top" align="left">0.32<xref ref-type="table-fn" rid="TN2"><sup>&#x0002A;&#x0002A;&#x0002A;</sup></xref></td>
<td valign="top" align="left">0.78<xref ref-type="table-fn" rid="TN4"><sup>&#x0002A;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">% lame cows</td>
<td valign="top" align="left">53.7</td>
<td valign="top" align="left">0.38<xref ref-type="table-fn" rid="TN2"><sup>&#x0002A;&#x0002A;&#x0002A;</sup></xref></td>
<td valign="top" align="left">0.76<xref ref-type="table-fn" rid="TN4"><sup>&#x0002A;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">% cows with swelling rest of body</td>
<td valign="top" align="left">36.6</td>
<td valign="top" align="left">0.16<xref ref-type="table-fn" rid="TN4"><sup>&#x0002A;</sup></xref></td>
<td valign="top" align="left">0.75<xref ref-type="table-fn" rid="TN4"><sup>&#x0002A;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">% cows with dirty udders log<sup>10</sup></td>
<td valign="top" align="left">41.5</td>
<td valign="top" align="left">0.22<xref ref-type="table-fn" rid="TN3"><sup>&#x0002A;&#x0002A;</sup></xref></td>
<td valign="top" align="left">0.71<xref ref-type="table-fn" rid="TN4"><sup>&#x0002A;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">% cows with dirty hindquarters</td>
<td valign="top" align="left">29.3</td>
<td valign="top" align="left">0.06</td>
<td valign="top" align="left">0.70<xref ref-type="table-fn" rid="TN4"><sup>&#x0002A;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">% cows with very dirty hindquarters SQRT</td>
<td valign="top" align="left">39.0</td>
<td valign="top" align="left">0.19<xref ref-type="table-fn" rid="TN4"><sup>&#x0002A;</sup></xref></td>
<td valign="top" align="left">0.70<xref ref-type="table-fn" rid="TN4"><sup>&#x0002A;</sup></xref></td>
</tr>
<tr>
<td valign="top" align="left">QBA PC &#x0201C;contentedness&#x0201D;</td>
<td valign="top" align="left">39.0</td>
<td valign="top" align="left">0.19<xref ref-type="table-fn" rid="TN4"><sup>&#x0002A;</sup></xref></td>
<td valign="top" align="left">0.68</td>
</tr>
<tr>
<td valign="top" align="left">% very lame cows log<sup>10</sup></td>
<td valign="top" align="left">36.6</td>
<td valign="top" align="left">0.15<xref ref-type="table-fn" rid="TN4"><sup>&#x0002A;</sup></xref></td>
<td valign="top" align="left">0.68</td>
</tr>
<tr>
<td valign="top" align="left">QD active</td>
<td valign="top" align="left">46.3</td>
<td valign="top" align="left">0.28<xref ref-type="table-fn" rid="TN2"><sup>&#x0002A;&#x0002A;&#x0002A;</sup></xref></td>
<td valign="top" align="left">0.67</td>
</tr>
<tr>
<td valign="top" align="left">% cows with nasal discharge</td>
<td valign="top" align="left">31.7</td>
<td valign="top" align="left">0.09</td>
<td valign="top" align="left">0.67</td>
</tr>
<tr>
<td valign="top" align="left">% cows with low protein at the first or second MR test day postpartum</td>
<td valign="top" align="left">35.3</td>
<td valign="top" align="left">0.14</td>
<td valign="top" align="left">0.65</td>
</tr>
<tr>
<td valign="top" align="left">Mean % cows feeding</td>
<td valign="top" align="left">34.2</td>
<td valign="top" align="left">0.12</td>
<td valign="top" align="left">0.64</td>
</tr>
<tr>
<td valign="top" align="left">% cows with hind leg swelling SQRT</td>
<td valign="top" align="left">31.7</td>
<td valign="top" align="left">0.09</td>
<td valign="top" align="left">0.63</td>
</tr>
<tr>
<td valign="top" align="left">% cows dull and depressed</td>
<td valign="top" align="left">36.8</td>
<td valign="top" align="left">0.16<xref ref-type="table-fn" rid="TN4"><sup>&#x0002A;</sup></xref></td>
<td valign="top" align="left">0.63</td>
</tr>
<tr>
<td valign="top" align="left">% cows with very dirty hind legs SQRT</td>
<td valign="top" align="left">34.1</td>
<td valign="top" align="left">0.12</td>
<td valign="top" align="left">0.62</td>
</tr>
<tr>
<td valign="top" align="left">Mean no. coughs/cow/15 min SQRT</td>
<td valign="top" align="left">26.8</td>
<td valign="top" align="left">0.02</td>
<td valign="top" align="left">0.59</td>
</tr>
</tbody>
</table>
<table-wrap-foot>
<p><italic>Welfare outcome measures are arranged in descending order of Kendall&#x00027;s coefficient of concordance</italic>.</p>
<p><italic>QBA, qualitative behavior assessment; QD, qualitative descriptor; PC, principal component; SQRT, square root transformed; log<sup>10</sup>, natural logarithm transformed; REV, reversed prior to transformation</italic>.</p>
<fn id="TN2">
<label>&#x0002A;&#x0002A;&#x0002A;</label>
<p><italic>p &#x0003C;0.001</italic>,</p></fn>
<fn id="TN3">
<label>&#x0002A;&#x0002A;</label>
<p><italic>p &#x0003C;0.01</italic>,</p></fn>
<fn id="TN4">
<label>&#x0002A;</label>
<p><italic>p &#x0003C;0.05</italic>.</p></fn>
</table-wrap-foot>
</table-wrap>
</sec>
</sec>
<sec sec-type="discussion" id="s4">
<title>Discussion</title>
<p>In this study, two different and complementary methods have been explored in an attempt to increase the feasibility of a comprehensive animal welfare outcome assessment used on UK dairy farms. The welfare outcome assessment protocol showed &#x0201C;good&#x0201D; to &#x0201C;very good&#x0201D; intra-observer reliability for almost all the measures that could be tested, and the large numbers of apparently biologically meaningful correlations between measures suggests the protocol had good internal validity. However, the findings suggest limited capacity for effectively reducing the numbers of welfare outcomes included within this two-day assessment, for reasons that will be discussed separately for the two approaches below.</p>
<p>The study sample can be considered suitably representative of the wider UK dairy farm population with respect to farm management. In line with the wider population, most farms in the study had Holstein-Friesian or Holstein cows as their predominant breed, housed their cows in cubicles, and calved all-year-round. Median herd size (180 cows) and mean 305 day milk yield/cow/year (8,290.9 L) were broadly similar to, but slightly higher than, the UK averages at the time [herd size: 126; milk yield: 7,535 L (AHDB Dairy, <xref ref-type="bibr" rid="B1">2015a</xref>)]. With respect to welfare performance, both the median and range of percentage lame cows are broadly in line with recent UK prevalence estimates (Griffiths et al., <xref ref-type="bibr" rid="B41">2018</xref>; Randall et al., <xref ref-type="bibr" rid="B63">2019</xref>).</p>
<sec>
<title>Pairwise Associations Between Individual Welfare Outcome Measures</title>
<p>As with previous literature (M&#x000FC;lleder et al., <xref ref-type="bibr" rid="B57">2007</xref>; Mullan et al., <xref ref-type="bibr" rid="B56">2009b</xref>; de Vries et al., <xref ref-type="bibr" rid="B27">2013a</xref>), the results of the pairwise correlations reveal relatively little scope for reducing the number of welfare outcome measures included in assessment protocols for UK dairy herds. This is because, although many significant associations existed between measures, these associations were generally relatively weak. Just 17 high or very high associations were found between measures of the same &#x0201C;type&#x0201D; (e.g., QD relaxed vs. QD calm, median calving interval vs. median calving to conception interval and % cows with dirty udders vs. % cows with very dirty hindquarters), similar to previous work (de Vries et al., <xref ref-type="bibr" rid="B27">2013a</xref>). In these cases, replacing one of these measures with the other would be possible, but would generally have little or no impact on assessment implementation time. For example, the Welfare Quality<sup>&#x000AE;</sup> observation time for QBA is 20 min regardless of how many QD terms are used. From our results, there is no obvious welfare outcome measure that could be excluded on the basis of high pairwise associations to enable meaningful time-saving and increased efficiency.</p>
<p>It is perhaps not surprising to primarily find pairwise associations of only relatively limited strength. Firstly, relationships between measures are often not causal in nature. Instead welfare compromises, such as lameness, poor body condition, or abnormal behavior, generally have very complex multifactorial etiologies whereby they are influenced over time by multiple different (interacting) risk factors (e.g., Espejo and Endres, <xref ref-type="bibr" rid="B34">2007</xref>; Dippel et al., <xref ref-type="bibr" rid="B31">2009</xref>). Secondly, and fundamentally, welfare outcome measures are indirect manifestations of the subjective, multidimensional welfare experience of animals (e.g., Mason and Mendl, <xref ref-type="bibr" rid="B53">1993</xref>; Duncan, <xref ref-type="bibr" rid="B32">2005</xref>). They may differ in terms of their validity and the extent to which they represent particular dimensions of welfare (Rushen and Passill&#x000E9;, <xref ref-type="bibr" rid="B66">1992</xref>; Botreau et al., <xref ref-type="bibr" rid="B10">2007a</xref>). However, it is reassuring that the correlations were generally biologically plausible and meaningful in terms of their direction of correlation being consistent with either good or poor welfare (<xref ref-type="supplementary-material" rid="SM6">Supplementary Table 6</xref>), suggesting validity of the welfare assessment protocol.</p>
</sec>
<sec>
<title>Identifying Iceberg Indicators of Dairy Herd Welfare</title>
<p>Our results show that significant associations did exist between 22 individual welfare outcome measures and composite welfare scales that we used to approximate herd overall welfare status (<xref ref-type="table" rid="T4">Table 4</xref>). On the basis of the methods used, the measures QD happy, QD calm, and percentage of lame cows were arguably the best performing predictors. It is encouraging that a combination of these three measures captured positive and negative dimensions of welfare. However, on their own each individual measure only correctly classified around 50% of farms.</p>
<p>It is unclear how much this level of performance is a consequence of the genuine predictive ability of the measures or the methods used to determine predictive ability. In the absence of a universal gold standard for welfare scale, there appears to be an unavoidable element of circularity within the identification process. That is, to determine whether individual welfare outcome measures can predict herd overall welfare status, we first need a gold standard measure of overall welfare status, and the most valid way to create this is by using welfare outcome measures themselves (Rushen and Passill&#x000E9;, <xref ref-type="bibr" rid="B66">1992</xref>; Knierim and Winckler, <xref ref-type="bibr" rid="B48">2009</xref>). Ideally, the measures used to determine the overall welfare status would be derived independently of the measures being tested, and yet they would still need to describe the same animals within the same situation (so that the underlying subjective welfare state would be consistent). We attempted to avoid the problem highlighted by Heath et al. (<xref ref-type="bibr" rid="B43">2014a</xref>) by testing the ability of each measure to predict an composite welfare summary measure that excluded itself. However, by definition, a measure with any predictive ability must be highly associated with variables comprising the summary scale. We removed predetermined cases of such circularity (e.g., when testing percentage lame cows, we removed not only that measure but also percentage of very lame cows from the summary PCA welfare component). Nevertheless, the pairwise associations show that many less obvious measures also correlated with certain individual measures (<xref ref-type="supplementary-material" rid="SM6">Supplementary Table 6</xref>). However, if we had removed all of these correlated variables from the complementary welfare scales, then, inevitably, the individual measure would no longer have shown much predictive ability. It is difficult to develop an approach to determine herd overall welfare status that does not rely on the measures that are also being tested as candidate iceberg indicators, because multiple measures are needed for a comprehensive assessment of multi-dimensional welfare (Botreau et al., <xref ref-type="bibr" rid="B10">2007a</xref>,<xref ref-type="bibr" rid="B11">b</xref>).</p>
<p>If we accept the results provided by the method used in this study, it is noteworthy that several QD measures performed well in the analyses, whilst the QBA components did less well. Other studies to date have seemingly not explored QDs independently, because they are not intended for use alone, but the predictive value of QBA has been investigated. Heath et al. (<xref ref-type="bibr" rid="B43">2014a</xref>) found that the QBA component of the Welfare Quality protocol was reasonably good at predicting the assessment&#x00027;s overall classification result. Furthermore, de Vries et al. (<xref ref-type="bibr" rid="B27">2013a</xref>) found that non-QBA aspects of the Welfare Quality<sup>&#x000AE;</sup> protocol were moderately good at predicting farm performance with respect to QBA&#x02014;in fact this was the best predicted aspect of the protocol. QBA has been described as a potentially highly &#x0201C;integrative&#x0201D; welfare assessment tool (Wemelsfelder et al., <xref ref-type="bibr" rid="B87">2001</xref>), because it is intended to summarize all observed aspects of animal behavior and physical condition into terms describing the animals&#x00027; subjective experience (Wemelsfelder, <xref ref-type="bibr" rid="B85">2007</xref>). It also provides measures of positive welfare that are difficult to capture by other methods. However, Andreasen et al. (<xref ref-type="bibr" rid="B3">2013</xref>) found no relationship between QBA and herd overall welfare status (using the Welfare Quality<sup>&#x000AE;</sup> overall classification result), and other studies investigating relationships between QBA and other aspects of welfare are mixed (Heath et al., <xref ref-type="bibr" rid="B43">2014a</xref>). Furthermore, questions exist around both the validity and reliability of QBA because it is so subjective (Wemelsfelder et al., <xref ref-type="bibr" rid="B86">2000</xref>, <xref ref-type="bibr" rid="B87">2001</xref>; Bokkers et al., <xref ref-type="bibr" rid="B9">2012</xref>). In our study, QD happy was the only measure not to attain &#x0201C;fair&#x0201D; (or above) intra-observer reliability at all three timepoints, despite having amongst the best predictive ability. We cannot assume that the predictive value of the QD or QBA assessments would have been similar with another assessor, and perhaps the conflicting results in the aforementioned studies are due to inter-assessor variation. Inter-observer agreement regarding QDs and QBA is clearly an area for continued research, because QBA reliability has previously been found to be poor in some studies (Bokkers et al., <xref ref-type="bibr" rid="B9">2012</xref>; Winckler, <xref ref-type="bibr" rid="B92">2014</xref>). In order for any potential iceberg indicators to be of use in an applied setting, such as for legal or assurance purposes, they will need to display an appropriate level of intra- and inter-observer reliability. We would therefore recommend similar studies are attempted with different and a larger number of assessors, to investigate whether the present findings for QD calm and happy&#x02014;as well as the other measures featured in our assessment protocol&#x02014;can be replicated when other assessors are used.</p>
<p>It should also be noted that, in the present study, we treated the individual QD terms (such as QD calm and QD happy) as individual welfare outcome measures in their own right, alongside our PCA generated summary measures of QBA (&#x0201C;contentedness,&#x0201D; &#x0201C;sociability,&#x0201D; and &#x0201C;agitation&#x0201D;), in case any were found to be redundant. Existing work on the validation and reliability of the QBA approach has generally focused on the resulting PCA summary measures, rather than the individual descriptor terms themselves. Interestingly, our results do appear to suggest that the individual QD terms are able to capture welfare relevant information and, for the most part, provide fairly good levels of (intra) observer agreement. However, as suggested above, it will be important to ensure these findings can be replicated beyond the present study.</p>
<p>Lameness prevalence also performed well in our analyses. Lameness indicates pain (Whay et al., <xref ref-type="bibr" rid="B90">2005</xref>), and is thus highly welfare relevant. It can be prevalent on UK farms (Barker et al., <xref ref-type="bibr" rid="B5">2010</xref>) and, consequently, it is frequently cited as among the most important welfare measures for dairy cattle (Whay et al., <xref ref-type="bibr" rid="B89">2003b</xref>). Consistent with our findings, de Vries et al. (<xref ref-type="bibr" rid="B27">2013a</xref>) found that non-lameness aspects of the Welfare Quality<sup>&#x000AE;</sup> protocol were moderately good at predicting the prevalence of (severely) lame cows&#x02014;this was the second best predicted aspect of the protocol after QBA. Its predictive ability may be due to lameness having a multifactorial etiology, reflecting the general quality of farm management, environment and stockmanship (Dippel et al., <xref ref-type="bibr" rid="B31">2009</xref>; Rutherford et al., <xref ref-type="bibr" rid="B67">2009</xref>; Barker et al., <xref ref-type="bibr" rid="B5">2010</xref>), and reflecting that pain thresholds are affected by mood (in humans and rodent models at least: Wiech and Tracey, <xref ref-type="bibr" rid="B91">2009</xref>).</p>
<p>It is interesting to note that the findings of Sandgren et al. (<xref ref-type="bibr" rid="B68">2009</xref>) and Nyman et al. (<xref ref-type="bibr" rid="B59">2011</xref>), which described good predictive ability of welfare outcome measures related to mortality and fertility, were not replicated in our study. None of the mortality and fertility measures investigated here were significantly associated with their respective composite welfare scales (although they did show significant pairwise associations with certain other measures, e.g., <xref ref-type="supplementary-material" rid="SM6">Supplementary Table 6</xref>). Reasons for the discrepancy between studies are difficult to discern, as there were many differences in methods and sampling.</p>
<p>Other measures of welfare that are usually considered important and might have served as good iceberg indicators, including rumination, lying behavior, body condition, and vulval discharge (e.g., FAWC, <xref ref-type="bibr" rid="B36">2009</xref>), did not perform especially well in this study. Body condition and vulval discharge varies considerably with lactation stage, but the farms in the present study exhibited a range of calving patterns and, therefore, herd stage of lactation was inconsistent across farms. Farms visited when cows were most &#x0201C;eligible&#x0201D; to have poor body condition/vulval discharge are likely to have more reliable estimates for these welfare outcomes than farms visited at a different time. In future, ideally measures of body condition/vulval discharge that take account of cow stage of lactation would be used, or, if this is not possible, the impact of stage of lactation could be investigated and possibly accounted for in the statistical analyses. Percentage of cows ruminating showed good predictive ability, but in an unexpected direction: higher percentages of cows ruminating significantly predicted measures of poorer welfare, whereas rumination is normally <italic>reduced</italic> with poor welfare conditions [e.g., metabolic disorders (Stangaferro et al., <xref ref-type="bibr" rid="B71">2016a</xref>), severe metritis (Stangaferro et al., <xref ref-type="bibr" rid="B73">2016c</xref>), and mastitis caused by <italic>E. coli</italic> (Stangaferro et al., <xref ref-type="bibr" rid="B72">2016b</xref>), but seemingly not with lameness (Walker et al., <xref ref-type="bibr" rid="B81">2008</xref>; Thorup et al., <xref ref-type="bibr" rid="B74">2016</xref>)]. This unexpected finding might be an artifact of how we measured rumination, because the scan sampling section of the protocol started directly after morning feed delivery or return from milking. This means that cows were likely to be feeding at that time, rather than ruminating, and feeding and rumination were mutually exclusive behaviors (Schirmann et al., <xref ref-type="bibr" rid="B69">2012</xref>). This is supported by the fact that % time spent ruminating correlated negatively with % time spent feeding (<xref ref-type="supplementary-material" rid="SM6">Supplementary Table 6</xref>), and greater % time spent feeding (in the hours following feed delivery) was significantly associated with better welfare on the composite welfare scale (<xref ref-type="table" rid="T4">Table 4</xref>). Farms are increasingly adopting rumination and activity monitoring, so continuous measures for both these will probably greatly assist further research in this area (Stangaferro et al., <xref ref-type="bibr" rid="B71">2016a</xref>).</p>
</sec>
<sec>
<title>Method for Determining Herd Overall Welfare Status</title>
<p>The validity of the developed composite welfare scale(s) as a proxy for the herds&#x00027; genuine overall welfare status was central to our attempts to identify iceberg indicators of dairy herd welfare. The use of PCA to aggregate measures based on their existing inter-relationships is a potentially more valid approach than, for example, the use of predetermined aggregation rules. There is currently very little scientific evidence on which to base such rules (e.g., relative weightings), so their use can lead to unexpected/unintentional aggregation results (de Vries et al., <xref ref-type="bibr" rid="B27">2013a</xref>; Heath et al., <xref ref-type="bibr" rid="B43">2014a</xref>; Buijs et al., <xref ref-type="bibr" rid="B16">2016</xref>). Furthermore, the composite welfare scale provided a relatively comprehensive &#x0201C;overall&#x0201D; assessment of welfare because it incorporated a relatively large number of different measures (e.g., QBA of herd behavior, lameness, cleanliness, swellings, nasal discharge, coughing, rumination, and feeding behavior). However, some of the measures included in the PCA (e.g., abrasions and social behavior) were not well-represented by the composite welfare scale (PC 1). This does not necessarily mean that they did not help measure welfare and are therefore unimportant. On the contrary, they may be particularly important to retain within welfare assessment protocols precisely because they captured different aspects of welfare from the measures that loaded onto PC 1 (which after all did only explain 17% of the total variation). Aggregations <italic>via</italic> PCA are purely correlational and may not all be biologically meaningful, so whether using data derived weightings, theory driven weightings, or no weightings at all, any approach could have unintended consequences if it led to the wrong measures being retained or excluded. A third consideration about the validity of the PCA method for summarizing welfare is that some measures could not be included in the aggregation process, either because their data type was unsuitable for PCA (e.g., the milk recording data generated measures) or because they could not be collected in the first place (e.g., prevalence/incidence of mastitis, dystocia etc.). It is possible, therefore, that the composite welfare scale describes particular aspects of herd welfare as opposed to the herds&#x00027; genuine overall welfare status. Ultimately, however, if welfare assessment protocols can be improved such that the individual measures are more suitable for inclusion within PCA, the developed composite welfare scale offers an alternative to predetermined aggregation methods, and is a promising proxy measure of herd overall welfare status.</p>
<p>Within this study we opted to use PC1 to create a measure of the overall herd welfare status, because it explained the most variation (albeit only 17%), and its loadings were consistent with an animal welfare interpretation. However, it is possible that other approaches could have been used to summarize the most important loadings on more than one principal component, although information might be lost through this selective method. Also, the precise method for unifying these loadings into a single value per farm could introduce the aforementioned difficulty of how variables from different components would need to be weighted.</p>
<p>We recommend that the validity of aggregation <italic>via</italic> PCA is further reviewed by investigating whether similar PCA results are achieved if the welfare outcome assessment protocol is repeated, for example, on different farms and/or if more welfare outcome measures are included in the analysis. Also, some of the measurement protocols should be reviewed to improve the likelihood of variables being suitable for inclusion within PCAs in future. For example, measurement protocols for variables that generated excessive zeros in this study, could be adjusted to lower the threshold for noting presence of the criterion being measured, or the timing of observations could be improved to better capture that measure.</p>
</sec>
<sec>
<title>Method for Identifying Iceberg Indicators of Dairy Herd Welfare</title>
<p>There are two potential limitations to converting farm welfare performance from a continuous scale into categories, approximating an applied rating (e.g., Welfare Quality, <xref ref-type="bibr" rid="B84">2009</xref>). Firstly, the conversion will inevitably have resulted in a certain amount of information loss. That is, the use of four &#x0201C;welfare performance categories&#x0201D; provides less detail than the true variation in welfare performance observed across farms. Secondly, the (quartile value) thresholds used to determine farm category membership are arbitrary and specific to the farms sampled, rather than providing absolute standards. Some studies have since also used quartiles, although the authors did not distinguish all four quartiles, instead denoting the worst quartile as that indicating &#x0201C;poor&#x0201D; animal welfare on the farms falling within it, whilst the remaining three quartiles denoted &#x0201C;acceptable&#x0201D; animal welfare (e.g., de Vries et al., <xref ref-type="bibr" rid="B29">2016</xref>; van Staaveren et al., <xref ref-type="bibr" rid="B77">2017</xref>). If we had created only two categories, the kappa agreement ratings would almost certainly have been higher than they were (because there is less scope for error with fewer options). The use of four categories does serve to describe the farms&#x00027; relative welfare performance more appropriately overall (farms in the first category did perform differently from farms in the fourth category), but the precise thresholds may not have been meaningful in terms of distinguishing &#x0201C;poor,&#x0201D; &#x0201C;acceptable,&#x0201D; &#x0201C;good,&#x0201D; or &#x0201C;excellent&#x0201D; animal welfare. Choice of threshold does influence how well different measures perform (Sandgren et al., <xref ref-type="bibr" rid="B68">2009</xref>), so the relative predictive ability of the different measures could change if different thresholds were used. A challenge, however, will be identifying the most appropriate thresholds for benchmarking (Mendl, <xref ref-type="bibr" rid="B54">1991</xref>; Botreau et al., <xref ref-type="bibr" rid="B10">2007a</xref>).</p>
<p>In this study, we used agreement statistics to test the predictive ability of each individual outcome measure with regards to the farm welfare categories, but other approaches could be used. For example, discriminant analysis could have been used to identify important outcomes loading onto the quartiles identified by PCA (Presi and Reist, <xref ref-type="bibr" rid="B60">2011</xref>). The results would still have been affected by where the category thresholds were defined, but discriminant analysis could be an efficient approach for future studies.</p>
</sec>
</sec>
<sec sec-type="conclusions" id="s5">
<title>Conclusion</title>
<p>Overall, we found a large number of associations between the different welfare outcome measures included in our assessment protocol. However, most pairwise associations were weak to moderate, and existed between highly related measures, so there appears to be relatively little scope for excluding individual measures from assessment protocols based on their pairwise relationships. Linear regression analysis revealed that 22 measures were significantly associated with their respective composite welfare scale. Subsequent analysis of their ability to predict the quartile classification of herds revealed that, of these, QD calm, QD happy and percentage of lame cows were the best performing measures, although their predictive ability was only moderately good. These measures may therefore be regarded as potential iceberg indicators capturing both positive and negative aspects of dairy herd welfare. Further research using the same methodological approach with a new sample of farms, multiple assessors to investigate inter-observer reliability, and improvement of certain individual welfare outcome measures is needed to test the external validity of the statistical methods used, and to confirm or refute our findings.</p>
<p>Until valid and reliable approaches that reduce the time required to perform effective welfare assessments are developed, it remains necessary to complete full welfare assessments, ensuring that animal welfare issues are not missed and appropriate standards are recognized.</p>
</sec>
<sec sec-type="data-availability" id="s6">
<title>Data Availability Statement</title>
<p>The original contributions presented in the study are included in the article/<xref ref-type="supplementary-material" rid="s11">Supplementary Material</xref>, further inquiries can be directed to the corresponding author.</p>
</sec>
<sec id="s7">
<title>Ethics Statement</title>
<p>The animal study was reviewed and approved by the Royal Veterinary College Ethics and Welfare Committee (Approval number: 2013 1236). Written informed consent was obtained from the owners for the participation of their animals in this study.</p>
</sec>
<sec id="s8">
<title>Author Contributions</title>
<p>SC, CB, CW, JC, and NB: contributed to conception and design of the study. SC: collected data, organized the database, performed the statistical analysis, and wrote the first draft of the manuscript. CB, CW, JC, and NB: supervised the project. Y-MC: gave statistical advice. CB: wrote sections of the manuscript. All authors contributed to manuscript revision, read, and approved the submitted version.</p>
</sec>
<sec sec-type="funding-information" id="s9">
<title>Funding</title>
<p>This project was funded <italic>via</italic> a BBSRC/RVC CASE studentship (VMG42) with AHDB Dairy (then named Dairy Co.).</p>
</sec>
<sec sec-type="COI-statement" id="conf1">
<title>Conflict of Interest</title>
<p>The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.</p>
</sec>
<sec sec-type="disclaimer" id="s10">
<title>Publisher&#x00027;s Note</title>
<p>All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.</p>
</sec>
</body>
<back>
<ack><p>We would like to thank Jenny Gibbons and the AHDB Dairy team for guidance and advice throughout the project. We are extremely grateful to staff at the 51 farms that participated in the cross-sectional study. Special thanks also go to all the following people and teams. Natalie Chancellor provided technical assistance with the development of the cross-sectional study intra-observer reliability tests, and the farm records review process. The AHDB Dairy Health, Welfare and Nutrition Research Partnership (particularly Cheryl Heath, David Main, Siobhan Mullan, and Marie Haskell), and the RVC Farm Animal Health and Production group (particularly Richard Booth), gave feedback on the development of the welfare outcome assessment protocol and the iceberg indicator aspects of the project. Paul Christian, Charlie Verity, and Graeme Webster at the RVC Farm, gave feedback on various aspects of the project and enabled pilot testing of the welfare outcome assessment protocol. Jo Speed provided AHDB Dairy Mobility Score training, and members of the Welfare Quality<sup>&#x000AE;</sup> consortium (particularly Christoph Winckler and Marlene Kirchner) provided Welfare Quality<sup>&#x000AE;</sup> protocol training. The Milkbench&#x0002B; team (particularly Karolina Klaskova) assisted with the Milkbench&#x0002B; profitability benchmarking scheme data. The British Cattle Movement Service provided movement data for the cross-sectional study farms.</p>
</ack>
<sec sec-type="supplementary-material" id="s11">
<title>Supplementary Material</title>
<p>The Supplementary Material for this article can be found online at: <ext-link ext-link-type="uri" xlink:href="https://www.frontiersin.org/articles/10.3389/fanim.2021.703380/full#supplementary-material">https://www.frontiersin.org/articles/10.3389/fanim.2021.703380/full#supplementary-material</ext-link></p>
<supplementary-material xlink:href="Table_1.docx" id="SM1" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_2.docx" id="SM2" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_3.docx" id="SM3" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_4.docx" id="SM4" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_5.docx" id="SM5" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_6.docx" id="SM6" mimetype="application/vnd.openxmlformats-officedocument.wordprocessingml.document" xmlns:xlink="http://www.w3.org/1999/xlink"/>
<supplementary-material xlink:href="Table_7.xlsx" id="SM7" mimetype="application/vnd.openxmlformats-officedocument.spreadsheetml.sheet" xmlns:xlink="http://www.w3.org/1999/xlink"/>
</sec>
<ref-list>
<title>References</title>
<ref id="B1">
<citation citation-type="book"><person-group person-group-type="author"><collab>AHDB Dairy</collab></person-group> (<year>2015a</year>). <source>Dairy Statistics: An Insider&#x00027;s Guide 2015</source>, <publisher-loc>Kenilworth</publisher-loc>.</citation>
</ref>
<ref id="B2">
<citation citation-type="web"><person-group person-group-type="author"><collab>AHDB Dairy</collab></person-group> (<year>2015b</year>). <source>Mobility Scoring for Dairy Cows</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://dairy.ahdb.org.uk/resources-library/technical-information/health-welfare/mobility-score-instructions/&#x00023;.VhdFareFPcs">http://dairy.ahdb.org.uk/resources-library/technical-information/health-welfare/mobility-score-instructions/&#x00023;.VhdFareFPcs</ext-link> (accessed October 07, 2015).</citation>
</ref>
<ref id="B3">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Andreasen</surname> <given-names>S. N.</given-names></name> <name><surname>Wemelsfelder</surname> <given-names>F.</given-names></name> <name><surname>Sand&#x000F8;e</surname> <given-names>P.</given-names></name> <name><surname>Forkman</surname> <given-names>B.</given-names></name></person-group> (<year>2013</year>). <article-title>The correlation of Qualitative Behavior Assessments with Welfare Quality<sup>&#x000AE;</sup> protocol outcomes in on-farm welfare assessment of dairy cattle</article-title>. <source>Appl. Anim. Behav. Sci.</source> <volume>143</volume>, <fpage>9</fpage>&#x02013;<lpage>17</lpage>. <pub-id pub-id-type="doi">10.1016/j.applanim.2012.11.013</pub-id></citation>
</ref>
<ref id="B4">
<citation citation-type="web"><person-group person-group-type="author"><collab>AssureWel</collab></person-group> (<year>2015</year>). <source>Dairy Cows</source>. Available online at: <ext-link ext-link-type="uri" xlink:href="http://www.assurewel.org/dairycows">http://www.assurewel.org/dairycows</ext-link> (accessed October 07, 2015).</citation>
</ref>
<ref id="B5">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Barker</surname> <given-names>Z. E.</given-names></name> <name><surname>Leach</surname> <given-names>K. A.</given-names></name> <name><surname>Whay</surname> <given-names>H. R.</given-names></name> <name><surname>Bell</surname> <given-names>N. J.</given-names></name> <name><surname>Main</surname> <given-names>D. C. J.</given-names></name></person-group> (<year>2010</year>). <article-title>Assessment of lameness prevalence and associated risk factors in dairy herds in England and Wales</article-title>. <source>J. Dairy Sci.</source> <volume>93</volume>, <fpage>932</fpage>&#x02013;<lpage>941</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2009-2309</pub-id><pub-id pub-id-type="pmid">20172213</pub-id></citation></ref>
<ref id="B6">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bender</surname> <given-names>R.</given-names></name> <name><surname>Lange</surname> <given-names>S.</given-names></name></person-group> (<year>2001</year>). <article-title>Adjusting for multiple testing - when and how?</article-title> <source>J. Clin. Epidemiol.</source> <volume>54</volume>, <fpage>343</fpage>&#x02013;<lpage>349</lpage>. <pub-id pub-id-type="doi">10.1016/S0895-4356(00)00314-0</pub-id><pub-id pub-id-type="pmid">11297884</pub-id></citation></ref>
<ref id="B7">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Berckmans</surname> <given-names>D.</given-names></name></person-group> (<year>2014</year>). <article-title>Precision livestock farming technologies for welfare management in intensive livestock systems</article-title>. <source>Revue Scientifique et Technique</source> <volume>33</volume>, <fpage>189</fpage>&#x02013;<lpage>198</lpage>. <pub-id pub-id-type="doi">10.20506/rst.33.1.2273</pub-id><pub-id pub-id-type="pmid">25000791</pub-id></citation></ref>
<ref id="B8">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Blokhuis</surname> <given-names>H. J.</given-names></name> <name><surname>Veissier</surname> <given-names>I.</given-names></name> <name><surname>Miele</surname> <given-names>M.</given-names></name> <name><surname>Jones</surname> <given-names>B.</given-names></name></person-group> (<year>2010</year>). <article-title>The Welfare Quality<sup>&#x000AE;</sup> project and beyond: Safeguarding farm animal well-being</article-title>. <source>Acta Agri. Scand. A Anim. Sci.</source> <volume>60</volume>, <fpage>129</fpage>&#x02013;<lpage>140</lpage>. <pub-id pub-id-type="doi">10.1080/09064702.2010.523480</pub-id></citation>
</ref>
<ref id="B9">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bokkers</surname> <given-names>E. A. M.</given-names></name> <name><surname>de Vries</surname> <given-names>M.</given-names></name> <name><surname>Antonissen</surname> <given-names>I.</given-names></name> <name><surname>de Boer</surname> <given-names>I. J. M.</given-names></name></person-group> (<year>2012</year>). <article-title>Inter- and intra-observer reliability of experienced and inexperienced observers for the Qualitative Behaviour Assessment in dairy cattle</article-title>. <source>Anim. Welfare</source> <volume>21</volume>, <fpage>307</fpage>&#x02013;<lpage>318</lpage>. <pub-id pub-id-type="doi">10.7120/09627286.21.3.307</pub-id></citation>
</ref>
<ref id="B10">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Botreau</surname> <given-names>R.</given-names></name> <name><surname>Bonde</surname> <given-names>M.</given-names></name> <name><surname>Butterworth</surname> <given-names>A.</given-names></name> <name><surname>Perny</surname> <given-names>P.</given-names></name> <name><surname>Bracke</surname> <given-names>M. B. M.</given-names></name> <name><surname>Capdeville</surname> <given-names>J.</given-names></name> <etal/></person-group>. (<year>2007a</year>). <article-title>Aggregation of measures to produce an overall assessment of animal welfare. Part 1: a review of existing methods</article-title>. <source>Animal</source> <volume>1</volume>, <fpage>1179</fpage>&#x02013;<lpage>1187</lpage>. <pub-id pub-id-type="doi">10.1017/S1751731107000535</pub-id><pub-id pub-id-type="pmid">22444862</pub-id></citation></ref>
<ref id="B11">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Botreau</surname> <given-names>R.</given-names></name> <name><surname>Bracke</surname> <given-names>M. B. M.</given-names></name> <name><surname>Perny</surname> <given-names>P.</given-names></name> <name><surname>Butterworth</surname> <given-names>A.</given-names></name> <name><surname>Capdeville</surname> <given-names>J.</given-names></name> <name><surname>Van Reenen</surname> <given-names>C. G.</given-names></name> <etal/></person-group>. (<year>2007b</year>). <article-title>Aggregation of measures to produce an overall assessment of animal welfare. Part 2: analysis of constraints</article-title>. <source>Animal</source> <volume>1</volume>, <fpage>1188</fpage>&#x02013;<lpage>1197</lpage>. <pub-id pub-id-type="doi">10.1017/S1751731107000547</pub-id><pub-id pub-id-type="pmid">22444863</pub-id></citation></ref>
<ref id="B12">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Botreau</surname> <given-names>R.</given-names></name> <name><surname>Capdeville</surname> <given-names>J.</given-names></name> <name><surname>Perny</surname> <given-names>P.</given-names></name> <name><surname>Veissier</surname> <given-names>I.</given-names></name></person-group> (<year>2008</year>). <article-title>Multicriteria evaluation of animal welfare at farm level: an application of MCDA methodologies foundations of computing and decision</article-title>. <source>Sciences</source> <volume>31</volume>, <fpage>287</fpage>&#x02013;<lpage>316</lpage>.</citation>
</ref>
<ref id="B13">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Botreau</surname> <given-names>R.</given-names></name> <name><surname>Veissier</surname> <given-names>I.</given-names></name> <name><surname>Perny</surname> <given-names>P.</given-names></name></person-group> (<year>2009</year>). <article-title>Overall assessment of animal welfare: strategy adopted in Welfare Quality<sup>&#x000AE;</sup></article-title>. <source>Anim. Welfare</source> <volume>18</volume>, <fpage>363</fpage>&#x02013;<lpage>370</lpage>.</citation>
</ref>
<ref id="B14">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bracke</surname> <given-names>M. B. M.</given-names></name> <name><surname>Metz</surname> <given-names>J. H. M.</given-names></name> <name><surname>Spruijt</surname> <given-names>B. M.</given-names></name> <name><surname>Schouten</surname> <given-names>W. G. P.</given-names></name></person-group> (<year>2002</year>). <article-title>Decision support system for overall welfare assessment in pregnant sows B: validation by expert opinion</article-title>. <source>J. Anim. Sci.</source> <volume>80</volume>, <fpage>1835</fpage>&#x02013;<lpage>1845</lpage>. <pub-id pub-id-type="doi">10.2527/2002.8071835x</pub-id><pub-id pub-id-type="pmid">12162650</pub-id></citation></ref>
<ref id="B15">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Brouwer</surname> <given-names>H.</given-names></name> <name><surname>Stegeman</surname> <given-names>J. A.</given-names></name> <name><surname>Straatsma</surname> <given-names>J. W.</given-names></name> <name><surname>Hooijer</surname> <given-names>G. A.</given-names></name> <name><surname>Schaik</surname> <given-names>G.v.</given-names></name></person-group> (<year>2015</year>). <article-title>The validity of a monitoring system based on routinely collected dairy cattle health data relative to a standardized herd check</article-title>. <source>Prev. Vet. Med.</source> <volume>122</volume>, <fpage>76</fpage>&#x02013;<lpage>82</lpage>. <pub-id pub-id-type="doi">10.1016/j.prevetmed.2015.09.009</pub-id><pub-id pub-id-type="pmid">26472123</pub-id></citation></ref>
<ref id="B16">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Buijs</surname> <given-names>S.</given-names></name> <name><surname>Ampe</surname> <given-names>B.</given-names></name> <name><surname>Tuyttens</surname> <given-names>F. A. M.</given-names></name></person-group> (<year>2016</year>). <article-title>Sensitivity of the Welfare Quality<sup>&#x000AE;</sup> broiler chicken protocol to differences between intensively reared indoor flocks: which factors explain overall classification?</article-title> <source>Animal</source> <volume>11</volume>, <fpage>244</fpage>&#x02013;<lpage>253</lpage>. <pub-id pub-id-type="doi">10.1017/S1751731116001476</pub-id><pub-id pub-id-type="pmid">27416919</pub-id></citation></ref>
<ref id="B17">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Calamari</surname> <given-names>L.</given-names></name> <name><surname>Bertoni</surname> <given-names>G.</given-names></name></person-group> (<year>2009</year>). <article-title>Model to evaluate welfare in dairy cow farms</article-title>. <source>Italian J. Anim Sci</source>. <volume>2009</volume>:<fpage>23</fpage>. <pub-id pub-id-type="doi">10.4081/ijas.2009.s1.301</pub-id></citation>
</ref>
<ref id="B18">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Capdeville</surname> <given-names>J.</given-names></name> <name><surname>Veissier</surname> <given-names>I.</given-names></name></person-group> (<year>2001</year>). <article-title>A method of assessing welfare in loose housed dairy cows at farm level, focusing on animal observations</article-title>. <source>Acta Agri. Scandi. A Anim. Sci.</source> <volume>51</volume>, <fpage>62</fpage>&#x02013;<lpage>68</lpage>. <pub-id pub-id-type="doi">10.1080/090647001316923081</pub-id></citation>
</ref>
<ref id="B19">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chapinal</surname> <given-names>N.</given-names></name> <name><surname>de Passill&#x000E9;</surname> <given-names>A. M.</given-names></name> <name><surname>Weary</surname> <given-names>D. M.</given-names></name> <name><surname>von Keyserlingk</surname> <given-names>M. A. G.</given-names></name> <name><surname>Rushen</surname> <given-names>J.</given-names></name></person-group> (<year>2009</year>). <article-title>Using gait score, walking speed, and lying behavior to detect hoof lesions in dairy cows</article-title>. <source>J. Dairy Sci.</source> <volume>92</volume>, <fpage>4365</fpage>&#x02013;<lpage>4374</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2009-2115</pub-id><pub-id pub-id-type="pmid">19700696</pub-id></citation></ref>
<ref id="B20">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Cicchetti</surname> <given-names>D. V.</given-names></name></person-group> (<year>1994</year>). <article-title>Guidelines, criteria, and rules of thumb for evaluating normed and standardized assessment instruments in psychology</article-title>. <source>Psychol Asses.</source> <volume>6</volume>:<fpage>284</fpage>&#x02013;<lpage>290</lpage>. <pub-id pub-id-type="doi">10.1037/1040-3590.6.4.284</pub-id></citation>
</ref>
<ref id="B21">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Collins</surname> <given-names>S.</given-names></name></person-group> (<year>2016a</year>). <article-title>&#x0201C;Chapter 3: investigating UK dairy farmer and cattle vet definitions of animal welfare and preferences for using different welfare outcomes,&#x0201D;</article-title> in <source>An Investigation of Whether and How Welfare Outcome Assessment Could Be Better Used by UK Dairy Farmers</source>. (PhD Thesis), <publisher-name>University of London</publisher-name>, <publisher-loc>London, United Kingdom</publisher-loc>, <fpage>49</fpage>&#x02013;<lpage>82</lpage>.</citation>
</ref>
<ref id="B22">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Collins</surname> <given-names>S.</given-names></name></person-group> (<year>2016b</year>). <article-title>&#x0201C;Chapter 4: exploring the possibility of improving welfare outcome assessment feasibility by optimising the number of measures included in assessment protocols for dairy herds,&#x0201D;</article-title> in <source>An Investigation of Whether and How Welfare Outcome Assessment Could Be Better Used by UK Dairy Farmers</source>. (PhD Thesis), <publisher-name>University of London</publisher-name>, <publisher-loc>London, United Kingdom</publisher-loc>, <fpage>83</fpage>&#x02013;<lpage>134</lpage>.</citation>
</ref>
<ref id="B23">
<citation citation-type="book"><person-group person-group-type="author"><collab>DairyCo</collab></person-group> (<year>2014</year>). <source>Evidence Report: Analysis of the Milkbench</source>&#x0002B; <italic>and International Dairy Benchmarking Data for 2012/13</italic>, <publisher-loc>Kenilworth</publisher-loc>.</citation>
</ref>
<ref id="B24">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dawkins</surname> <given-names>M. S.</given-names></name></person-group> (<year>2006</year>). <article-title>A user&#x00027;s guide to animal welfare science</article-title>. <source>Trends Ecol. Evol.</source> <volume>21</volume>, <fpage>77</fpage>&#x02013;<lpage>82</lpage>. <pub-id pub-id-type="doi">10.1016/j.tree.2005.10.017</pub-id><pub-id pub-id-type="pmid">16701478</pub-id></citation></ref>
<ref id="B25">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Jong</surname> <given-names>I. C.</given-names></name> <name><surname>Hindle</surname> <given-names>V. A.</given-names></name> <name><surname>Butterworth</surname> <given-names>A.</given-names></name> <name><surname>Engel</surname> <given-names>B.</given-names></name> <name><surname>Ferrari</surname> <given-names>P.</given-names></name> <name><surname>Gunnink</surname> <given-names>H.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Simplifying the Welfare Quality<sup>&#x000AE;</sup> assessment protocol for broiler chicken welfare</article-title>. <source>Animal</source> <volume>10</volume>, <fpage>117</fpage>&#x02013;<lpage>127</lpage>. <pub-id pub-id-type="doi">10.1017/S1751731115001706</pub-id><pub-id pub-id-type="pmid">26306882</pub-id></citation></ref>
<ref id="B26">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Vries</surname> <given-names>M.</given-names></name> <name><surname>Bokkers</surname> <given-names>E. A. M.</given-names></name> <name><surname>Dijkstra</surname> <given-names>T.</given-names></name> <name><surname>van Schaik</surname> <given-names>G.</given-names></name> <name><surname>de Boer</surname> <given-names>I. J. M.</given-names></name></person-group> (<year>2011</year>). <article-title>Invited review: associations between variables of routine herd data and dairy cattle welfare indicators</article-title>. <source>J. Dairy Sci.</source> <volume>94</volume>, <fpage>3213</fpage>&#x02013;<lpage>3228</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2011-4169</pub-id><pub-id pub-id-type="pmid">21700006</pub-id></citation></ref>
<ref id="B27">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Vries</surname> <given-names>M.</given-names></name> <name><surname>Bokkers</surname> <given-names>E. A. M.</given-names></name> <name><surname>van Schaik</surname> <given-names>G.</given-names></name> <name><surname>Botreau</surname> <given-names>R.</given-names></name> <name><surname>Engel</surname> <given-names>B.</given-names></name> <name><surname>Dijkstra</surname> <given-names>T.</given-names></name> <etal/></person-group>. (<year>2013a</year>). <article-title>Evaluating results of the Welfare Quality multi-criteria evaluation model for classification of dairy cattle welfare at the herd level</article-title>. <source>J. Dairy Sci.</source> <volume>96</volume>, <fpage>6264</fpage>&#x02013;<lpage>6273</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2012-6129</pub-id><pub-id pub-id-type="pmid">23932136</pub-id></citation></ref>
<ref id="B28">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Vries</surname> <given-names>M.</given-names></name> <name><surname>Bokkers</surname> <given-names>E. A. M.</given-names></name> <name><surname>van Schaik</surname> <given-names>G.</given-names></name> <name><surname>Engel</surname> <given-names>B.</given-names></name> <name><surname>Dijkstra</surname> <given-names>T.</given-names></name> <name><surname>de Boer</surname> <given-names>I. J. M.</given-names></name></person-group> (<year>2014</year>). <article-title>Exploring the value of routinely collected herd data for estimating dairy cattle welfare</article-title>. <source>J. Dairy Sci.</source> <volume>97</volume>, <fpage>715</fpage>&#x02013;<lpage>730</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2013-6585</pub-id><pub-id pub-id-type="pmid">24290821</pub-id></citation></ref>
<ref id="B29">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Vries</surname> <given-names>M.</given-names></name> <name><surname>Bokkers</surname> <given-names>E. A. M.</given-names></name> <name><surname>van Schaik</surname> <given-names>G.</given-names></name> <name><surname>Engel</surname> <given-names>B.</given-names></name> <name><surname>Dijkstra</surname> <given-names>T.</given-names></name> <name><surname>de Boer</surname> <given-names>I. J. M.</given-names></name></person-group> (<year>2016</year>). <article-title>Improving the time efficiency of identifying dairy herds with poorer welfare in a population</article-title>. <source>J. Dairy Sci.</source> <volume>99</volume>, <fpage>8282</fpage>&#x02013;<lpage>8296</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2015-9979</pub-id><pub-id pub-id-type="pmid">27423954</pub-id></citation></ref>
<ref id="B30">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>de Vries</surname> <given-names>M.</given-names></name> <name><surname>Engel</surname> <given-names>B.</given-names></name> <name><surname>den Uijl</surname> <given-names>I.</given-names></name> <name><surname>van Schaik</surname> <given-names>G.</given-names></name> <name><surname>Dijkstra</surname> <given-names>T.</given-names></name> <name><surname>de Boer</surname> <given-names>I. J. M.</given-names></name> <etal/></person-group>. (<year>2013b</year>). <article-title>Assessment time of the Welfare Quality<sup>&#x000AE;</sup> protocol for dairy cattle</article-title>. <source>Anim. Welfare</source> <volume>22</volume>, <fpage>85</fpage>&#x02013;<lpage>93</lpage>. <pub-id pub-id-type="doi">10.7120/09627286.22.1.085</pub-id></citation>
</ref>
<ref id="B31">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dippel</surname> <given-names>S.</given-names></name> <name><surname>Dolezal</surname> <given-names>M.</given-names></name> <name><surname>Brenninkmeyer</surname> <given-names>C.</given-names></name> <name><surname>Brinkmann</surname> <given-names>J.</given-names></name> <name><surname>March</surname> <given-names>S.</given-names></name> <name><surname>Knierim</surname> <given-names>U.</given-names></name> <etal/></person-group>. (<year>2009</year>). <article-title>Risk factors for lameness in freestall-housed dairy cows across two breeds, farming systems, and countries</article-title>. <source>J. Dairy Sci.</source> <volume>92</volume>, <fpage>5476</fpage>&#x02013;<lpage>5486</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2009-2288</pub-id><pub-id pub-id-type="pmid">19841210</pub-id></citation></ref>
<ref id="B32">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Duncan</surname> <given-names>I. J. H.</given-names></name></person-group> (<year>2005</year>). <article-title>Science-based assessment of animal welfare: farm animals</article-title>. <source>Revue scientifique et technique Office international des epizooties</source> <volume>24</volume>, <fpage>483</fpage>&#x02013;<lpage>492</lpage>. <pub-id pub-id-type="doi">10.20506/rst.24.2.1587</pub-id><pub-id pub-id-type="pmid">16358502</pub-id></citation></ref>
<ref id="B33">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Endres</surname> <given-names>M. I.</given-names></name> <name><surname>Lobeck-Luchterhand</surname> <given-names>K. M.</given-names></name> <name><surname>Espejo</surname> <given-names>L. A.</given-names></name> <name><surname>Tucker</surname> <given-names>C. B.</given-names></name></person-group> (<year>2014</year>). <article-title>Evaluation of the sample needed to accurately estimate outcome-based measurements of dairy welfare on farm</article-title>. <source>J. Dairy Sci.</source> <volume>97</volume>, <fpage>3523</fpage>&#x02013;<lpage>3530</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2013-7464</pub-id><pub-id pub-id-type="pmid">24657083</pub-id></citation></ref>
<ref id="B34">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Espejo</surname> <given-names>L. A.</given-names></name> <name><surname>Endres</surname> <given-names>M. I.</given-names></name></person-group> (<year>2007</year>). <article-title>Herd-level risk factors for lameness in high-producing holstein cows housed in freestall barns</article-title>. <source>J. Dairy Sci.</source> <volume>90</volume>, <fpage>306</fpage>&#x02013;<lpage>314</lpage>. <pub-id pub-id-type="doi">10.3168/jds.S0022-0302(07)72631-0</pub-id><pub-id pub-id-type="pmid">17183098</pub-id></citation></ref>
<ref id="B35">
<citation citation-type="book"><person-group person-group-type="author"><collab>FAWC</collab></person-group> (<year>2005</year>). <source>Report on the Welfare Implications of Farm Assurance Schemes</source>. <publisher-loc>London</publisher-loc>: <publisher-name>Farm Animal Welfare Committee</publisher-name>.</citation>
</ref>
<ref id="B36">
<citation citation-type="book"><person-group person-group-type="author"><collab>FAWC</collab></person-group> (<year>2009</year>). <source>Farm Animal Welfare in Great Britain: Past, Present and Future</source>. <publisher-loc>London</publisher-loc>: <publisher-name>Farm Animal Welfare Committee</publisher-name>.</citation>
</ref>
<ref id="B37">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Field</surname> <given-names>A.</given-names></name></person-group> (<year>2013</year>). <source>Discovering Statistics Using IBM SPSS Statistics</source>. <publisher-loc>London</publisher-loc>: <publisher-name>Sage Publications</publisher-name>.</citation>
</ref>
<ref id="B38">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Forkman</surname> <given-names>B.</given-names></name> <name><surname>Keeling</surname> <given-names>L. J.</given-names></name></person-group> (<year>2009</year>). <article-title>&#x0201C;Assessment of animal welfare measures for dairy cattle, beef bulls and veal calves,&#x0201D;</article-title> in <source>Welfare Quality Reports No. 11</source>, eds <person-group person-group-type="editor"><name><surname>Miele</surname> <given-names>M.</given-names></name> <name><surname>Roex</surname> <given-names>J.</given-names></name></person-group> (<publisher-loc>Uppsala</publisher-loc>).</citation>
</ref>
<ref id="B39">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fraser</surname> <given-names>D.</given-names></name> <name><surname>Weary</surname> <given-names>D. M.</given-names></name> <name><surname>Pajor</surname> <given-names>E. A.</given-names></name> <name><surname>Milligan</surname> <given-names>B. N.</given-names></name></person-group> (<year>1997</year>). <article-title>A scientific conception of animal welfare that reflects ethical concerns</article-title>. <source>Anim. Welfare</source> <volume>6</volume>, <fpage>187</fpage>&#x02013;<lpage>205</lpage>.</citation>
</ref>
<ref id="B40">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Green</surname> <given-names>L. E.</given-names></name> <name><surname>Huxley</surname> <given-names>J. N.</given-names></name> <name><surname>Banks</surname> <given-names>C.</given-names></name> <name><surname>Green</surname> <given-names>M. J.</given-names></name></person-group> (<year>2014</year>). <article-title>Temporal associations between low body condition, lameness and milk yield in a UK dairy herd</article-title>. <source>Prev. Vet. Med.</source> <volume>113</volume>, <fpage>63</fpage>&#x02013;<lpage>71</lpage>. <pub-id pub-id-type="doi">10.1016/j.prevetmed.2013.10.009</pub-id><pub-id pub-id-type="pmid">24183787</pub-id></citation></ref>
<ref id="B41">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Griffiths</surname> <given-names>B. E.</given-names></name> <name><surname>Grove White</surname> <given-names>D.</given-names></name> <name><surname>Oikonomou</surname> <given-names>G.</given-names></name></person-group> (<year>2018</year>). <article-title>A cross-sectional study into the prevalence of dairy cattle lameness and associated herd-level risk factors in England and Wales</article-title>. <source>Front. Vet. Sci.</source> <volume>5</volume>:<fpage>65</fpage>. <pub-id pub-id-type="doi">10.3389/fvets.2018.00065</pub-id><pub-id pub-id-type="pmid">29675419</pub-id></citation></ref>
<ref id="B42">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Haskell</surname> <given-names>M. J.</given-names></name> <name><surname>Rennie</surname> <given-names>L. J.</given-names></name> <name><surname>Bowell</surname> <given-names>V. A.</given-names></name> <name><surname>Bell</surname> <given-names>M. J.</given-names></name> <name><surname>Lawrence</surname> <given-names>A. B.</given-names></name></person-group> (<year>2006</year>). <article-title>Housing system, milk production, and zero-grazing effects on lameness and leg injury in dairy cows</article-title>. <source>J. Dairy Sci.</source> <volume>89</volume>, <fpage>4259</fpage>&#x02013;<lpage>4266</lpage>. <pub-id pub-id-type="doi">10.3168/jds.S0022-0302(06)72472-9</pub-id><pub-id pub-id-type="pmid">17033013</pub-id></citation></ref>
<ref id="B43">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Heath</surname> <given-names>C. A. E.</given-names></name> <name><surname>Browne</surname> <given-names>W. J.</given-names></name> <name><surname>Mullan</surname> <given-names>S.</given-names></name> <name><surname>Main</surname> <given-names>D. C. J.</given-names></name></person-group> (<year>2014a</year>). <article-title>Navigating the iceberg: reducing the number of parameters within the Welfare Quality<sup>&#x000AE;</sup> assessment protocol for dairy cows</article-title>. <source>Animal</source> <volume>8</volume>, <fpage>1978</fpage>&#x02013;<lpage>1986</lpage>. <pub-id pub-id-type="doi">10.1017/S1751731114002018</pub-id><pub-id pub-id-type="pmid">25159607</pub-id></citation></ref>
<ref id="B44">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Heath</surname> <given-names>C. A. E.</given-names></name> <name><surname>Lin</surname> <given-names>Y.</given-names></name> <name><surname>Mullan</surname> <given-names>S.</given-names></name> <name><surname>Browne</surname> <given-names>W. J.</given-names></name> <name><surname>Main</surname> <given-names>D. C. J.</given-names></name></person-group> (<year>2014b</year>). <article-title>Implementing Welfare Quality<sup>&#x000AE;</sup> in UK assurance schemes: evaluating the challenges</article-title>. <source>Anim. Welfare</source> <volume>23</volume>, <fpage>95</fpage>&#x02013;<lpage>107</lpage>. <pub-id pub-id-type="doi">10.7120/09627286.23.1.095</pub-id></citation>
</ref>
<ref id="B45">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Heath</surname> <given-names>C. A. E.</given-names></name> <name><surname>Main</surname> <given-names>D. C. J.</given-names></name> <name><surname>Mullan</surname> <given-names>S.</given-names></name> <name><surname>Haskell</surname> <given-names>M. J.</given-names></name> <name><surname>Browne</surname> <given-names>W. J.</given-names></name></person-group> (<year>2015</year>). <article-title>Sequential sampling: a novel method in farm animal welfare assessment</article-title>. <source>Animal</source>. <volume>10</volume>, <fpage>349</fpage>&#x02013;<lpage>356</lpage>. <pub-id pub-id-type="doi">10.1017/S1751731115001536</pub-id><pub-id pub-id-type="pmid">26264118</pub-id></citation></ref>
<ref id="B46">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Honey</surname> <given-names>L.</given-names></name></person-group> (<year>2013</year>). <article-title>Assuring the welfare of food animals</article-title>. <source>Vet. Rec.</source> <volume>173</volume>, <fpage>568</fpage>&#x02013;<lpage>569</lpage>. <pub-id pub-id-type="doi">10.1136/vr.f7319</pub-id><pub-id pub-id-type="pmid">24337085</pub-id></citation></ref>
<ref id="B47">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ito</surname> <given-names>K.</given-names></name> <name><surname>Weary</surname> <given-names>D. M.</given-names></name> <name><surname>von Keyserlingk</surname> <given-names>M. A. G.</given-names></name></person-group> (<year>2009</year>). <article-title>Lying behavior: assessing within- and between-herd variation in free-stall-housed dairy cows</article-title>. <source>J. Dairy Sci.</source> <volume>92</volume>, <fpage>4412</fpage>&#x02013;<lpage>4420</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2009-2235</pub-id><pub-id pub-id-type="pmid">19700701</pub-id></citation></ref>
<ref id="B48">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Knierim</surname> <given-names>U.</given-names></name> <name><surname>Winckler</surname> <given-names>C.</given-names></name></person-group> (<year>2009</year>). <article-title>On-farm welfare assessment in cattle: validity, reliability and feasibility issues and future perspectives with special regard to the Welfare Quality<sup>&#x000AE;</sup> approach</article-title>. <source>Anim. Welfare</source> <volume>18</volume>, <fpage>451</fpage>&#x02013;<lpage>458</lpage>.</citation>
</ref>
<ref id="B49">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Krug</surname> <given-names>C.</given-names></name> <name><surname>Haskell</surname> <given-names>M. J.</given-names></name> <name><surname>Nunes</surname> <given-names>T.</given-names></name> <name><surname>Stilwell</surname> <given-names>G.</given-names></name></person-group> (<year>2015</year>). <article-title>Creating a model to detect dairy cattle farms with poor welfare using a national database</article-title>. <source>Prev. Vet. Med.</source> <volume>122</volume>, <fpage>280</fpage>&#x02013;<lpage>286</lpage>. <pub-id pub-id-type="doi">10.1016/j.prevetmed.2015.10.014</pub-id><pub-id pub-id-type="pmid">26549665</pub-id></citation></ref>
<ref id="B50">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Landis</surname> <given-names>J. T.</given-names></name> <name><surname>Koch</surname> <given-names>G. G.</given-names></name></person-group> (<year>1977</year>). <article-title>The measurement of observer agreement for categorical data</article-title>. <source>Biometrics</source> <volume>33</volume>, <fpage>159</fpage>&#x02013;<lpage>174</lpage>. <pub-id pub-id-type="doi">10.2307/2529310</pub-id><pub-id pub-id-type="pmid">843571</pub-id></citation></ref>
<ref id="B51">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Main</surname> <given-names>D. C. J.</given-names></name> <name><surname>Barker</surname> <given-names>Z. E.</given-names></name> <name><surname>Leach</surname> <given-names>K. A.</given-names></name> <name><surname>Bell</surname> <given-names>N. J.</given-names></name> <name><surname>Whay</surname> <given-names>H. R.</given-names></name> <name><surname>Browne</surname> <given-names>W. J.</given-names></name></person-group> (<year>2010</year>). <article-title>Sampling strategies for monitoring lameness in dairy cattle</article-title>. <source>J. Dairy Sci.</source> <volume>93</volume>, <fpage>1970</fpage>&#x02013;<lpage>1978</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2009-2500</pub-id><pub-id pub-id-type="pmid">20412910</pub-id></citation></ref>
<ref id="B52">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Martin</surname> <given-names>P.</given-names></name> <name><surname>Bateson</surname> <given-names>P.</given-names></name></person-group> (<year>2007</year>). <source>Measuring Behaviour: An Introductory Guide</source>. <publisher-loc>Cambridge</publisher-loc>: <publisher-name>Cambridge University Press</publisher-name>. <pub-id pub-id-type="doi">10.1017/CBO9780511810893</pub-id><pub-id pub-id-type="pmid">30886898</pub-id></citation></ref>
<ref id="B53">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mason</surname> <given-names>G.</given-names></name> <name><surname>Mendl</surname> <given-names>M.</given-names></name></person-group> (<year>1993</year>). <article-title>Why is there no simple way of measuring animal welfare?</article-title> <source>Anim. Welfare</source> <volume>2</volume>, <fpage>301</fpage>&#x02013;<lpage>319</lpage>.</citation>
</ref>
<ref id="B54">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mendl</surname> <given-names>M.</given-names></name></person-group> (<year>1991</year>). <article-title>Some problems with the concept of a cut-off point for determining when an animal&#x00027;s welfare is at risk</article-title>. <source>Appl. Anim. Behav. Sci.</source> <volume>31</volume>, <fpage>139</fpage>&#x02013;<lpage>146</lpage>. <pub-id pub-id-type="doi">10.1016/0168-1591(91)90161-P</pub-id></citation>
</ref>
<ref id="B55">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mullan</surname> <given-names>S.</given-names></name> <name><surname>Browne</surname> <given-names>W. J.</given-names></name> <name><surname>Edwards</surname> <given-names>S. A.</given-names></name> <name><surname>Butterworth</surname> <given-names>A.</given-names></name> <name><surname>Whay</surname> <given-names>H. R.</given-names></name> <name><surname>Main</surname> <given-names>D. C. J.</given-names></name></person-group> (<year>2009a</year>). <article-title>The effect of sampling strategy on the estimated prevalence of welfare outcome measures on finishing pig farms</article-title>. <source>Appl. Anim. Behav. Sci.</source> <volume>119</volume>, <fpage>39</fpage>&#x02013;<lpage>48</lpage>. <pub-id pub-id-type="doi">10.1016/j.applanim.2009.03.008</pub-id></citation>
</ref>
<ref id="B56">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mullan</surname> <given-names>S.</given-names></name> <name><surname>Edwards</surname> <given-names>S. A.</given-names></name> <name><surname>Butterworth</surname> <given-names>A.</given-names></name> <name><surname>Whay</surname> <given-names>H. R.</given-names></name> <name><surname>Main</surname> <given-names>D. C. J.</given-names></name></person-group> (<year>2009b</year>). <article-title>Interdependence of welfare outcome measures and potential confounding factors on finishing pig farms</article-title>. <source>Appl. Anim. Behav. Sci.</source> <volume>121</volume>, <fpage>25</fpage>&#x02013;<lpage>31</lpage>. <pub-id pub-id-type="doi">10.1016/j.applanim.2009.07.002</pub-id></citation>
</ref>
<ref id="B57">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>M&#x000FC;lleder</surname> <given-names>C.</given-names></name> <name><surname>Troxler</surname> <given-names>J.</given-names></name> <name><surname>Laaha</surname> <given-names>G.</given-names></name> <name><surname>Waiblinger</surname> <given-names>S.</given-names></name></person-group> (<year>2007</year>). <article-title>Can environmental variables replace some animal-based parameters in welfare assessment of dairy cows?</article-title> <source>Anim. Welfare</source> <volume>16</volume>, <fpage>153</fpage>&#x02013;<lpage>156</lpage>.</citation>
</ref>
<ref id="B58">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nicol</surname> <given-names>C.</given-names></name> <name><surname>Caplen</surname> <given-names>G.</given-names></name> <name><surname>Edgar</surname> <given-names>J.</given-names></name> <name><surname>Richards</surname> <given-names>G.</given-names></name> <name><surname>Browne</surname> <given-names>W.</given-names></name></person-group> (<year>2011</year>). <article-title>Relationships between multiple welfare indicators measured in individual chickens across different time periods and environments</article-title>. <source>Anim. Welfare</source> <volume>20</volume>, <fpage>133</fpage>&#x02013;<lpage>143</lpage>.</citation>
</ref>
<ref id="B59">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Nyman</surname> <given-names>A.-K.</given-names></name> <name><surname>Lindberg</surname> <given-names>A.</given-names></name> <name><surname>Sandgren</surname> <given-names>C. H.</given-names></name></person-group> (<year>2011</year>). <article-title>Can pre-collected register data be used to identify dairy herds with good cattle welfare?</article-title> <source>Acta Vet. Scand.</source> <volume>53</volume>, <fpage>S8</fpage>&#x02013;<lpage>S8</lpage>. <pub-id pub-id-type="doi">10.1186/1751-0147-53-S1-S8</pub-id><pub-id pub-id-type="pmid">21999569</pub-id></citation></ref>
<ref id="B60">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Presi</surname> <given-names>P.</given-names></name> <name><surname>Reist</surname> <given-names>M.</given-names></name></person-group> (<year>2011</year>). <article-title>Review of methodologies applicable to the validation of animal based indicators of welfare</article-title>. <source>EFSA Support. Publi.</source> <volume>8</volume>:<fpage>171E</fpage>. <pub-id pub-id-type="doi">10.2903/sp.efsa.2011.EN-171</pub-id></citation>
</ref>
<ref id="B61">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Proudfoot</surname> <given-names>K. L.</given-names></name> <name><surname>Weary</surname> <given-names>D. M.</given-names></name> <name><surname>von Keyserlingk</surname> <given-names>M. A. G.</given-names></name></person-group> (<year>2010</year>). <article-title>Behavior during transition differs for cows diagnosed with claw horn lesions in mid lactation</article-title>. <source>J. Dairy Sci.</source> <volume>93</volume>, <fpage>3970</fpage>&#x02013;<lpage>3978</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2009-2767</pub-id><pub-id pub-id-type="pmid">20723672</pub-id></citation></ref>
<ref id="B62">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Randall</surname> <given-names>L. V.</given-names></name> <name><surname>Green</surname> <given-names>M. J.</given-names></name> <name><surname>Chagunda</surname> <given-names>M. G. G.</given-names></name> <name><surname>Mason</surname> <given-names>C.</given-names></name> <name><surname>Archer</surname> <given-names>S. C.</given-names></name> <name><surname>Green</surname> <given-names>L. E.</given-names></name> <etal/></person-group>. (<year>2015</year>). <article-title>Low body condition predisposes cattle to lameness: an 8-year study of one dairy herd</article-title>. <source>J. Dairy Sci.</source> <volume>98</volume>, <fpage>3766</fpage>&#x02013;<lpage>3777</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2014-8863</pub-id><pub-id pub-id-type="pmid">25828666</pub-id></citation></ref>
<ref id="B63">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Randall</surname> <given-names>L. V.</given-names></name> <name><surname>Thomas</surname> <given-names>H. J.</given-names></name> <name><surname>Remnant</surname> <given-names>J. G.</given-names></name> <name><surname>Bollard</surname> <given-names>N. J.</given-names></name> <name><surname>Huxley</surname> <given-names>J. N.</given-names></name></person-group> (<year>2019</year>). <article-title>Lameness prevalence in a random sample of UK dairy herds</article-title>. <source>Vet. Rec.</source> <volume>184</volume>, <fpage>350</fpage>&#x02013;<lpage>350</lpage>. <pub-id pub-id-type="doi">10.1136/vr.105047</pub-id><pub-id pub-id-type="pmid">30824601</pub-id></citation></ref>
<ref id="B64">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Roche</surname> <given-names>J. R.</given-names></name> <name><surname>Friggens</surname> <given-names>N. C.</given-names></name> <name><surname>Kay</surname> <given-names>J. K.</given-names></name> <name><surname>Fisher</surname> <given-names>M. W.</given-names></name> <name><surname>Stafford</surname> <given-names>K. J.</given-names></name> <name><surname>Berry</surname> <given-names>D. P.</given-names></name></person-group> (<year>2009</year>). <article-title>Invited review: body condition score and its association with dairy cow productivity, health, and welfare</article-title>. <source>J. Dairy Sci.</source> <volume>92</volume>, <fpage>5769</fpage>&#x02013;<lpage>5801</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2009-2431</pub-id><pub-id pub-id-type="pmid">19923585</pub-id></citation></ref>
<ref id="B65">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rushen</surname> <given-names>J.</given-names></name> <name><surname>Chapinal</surname> <given-names>N.</given-names></name> <name><surname>de Passill&#x000E9;</surname> <given-names>A. M.</given-names></name></person-group> (<year>2012</year>). <article-title>Automated monitoring of behavioural-based animal welfare indicators</article-title>. <source>Anim. Welfare</source> <volume>21</volume>, <fpage>339</fpage>&#x02013;<lpage>350</lpage>. <pub-id pub-id-type="doi">10.7120/09627286.21.3.339</pub-id><pub-id pub-id-type="pmid">30347653</pub-id></citation></ref>
<ref id="B66">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rushen</surname> <given-names>J.</given-names></name> <name><surname>Passill&#x000E9;</surname> <given-names>A. M. B.d.</given-names></name></person-group> (<year>1992</year>). <article-title>The scientific assessment of the impact of housing on animal welfare: a critical review</article-title>. <source>Can. J. Anim. Sci.</source> <volume>72</volume>, <fpage>721</fpage>&#x02013;<lpage>743</lpage>. <pub-id pub-id-type="doi">10.4141/cjas92-085</pub-id><pub-id pub-id-type="pmid">18625583</pub-id></citation></ref>
<ref id="B67">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rutherford</surname> <given-names>K. M. D.</given-names></name> <name><surname>Langford</surname> <given-names>F. M.</given-names></name> <name><surname>Jack</surname> <given-names>M. C.</given-names></name> <name><surname>Sherwood</surname> <given-names>L.</given-names></name> <name><surname>Lawrence</surname> <given-names>A. B.</given-names></name> <name><surname>Haskell</surname> <given-names>M. J.</given-names></name></person-group> (<year>2009</year>). <article-title>Lameness prevalence and risk factors in organic and non-organic dairy herds in the United Kingdom</article-title>. <source>Vet. J.</source> <volume>180</volume>, <fpage>95</fpage>&#x02013;<lpage>105</lpage>. <pub-id pub-id-type="doi">10.1016/j.tvjl.2008.03.015</pub-id><pub-id pub-id-type="pmid">18462961</pub-id></citation></ref>
<ref id="B68">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sandgren</surname> <given-names>C. H.</given-names></name> <name><surname>Lindberg</surname> <given-names>A.</given-names></name> <name><surname>Keeling</surname> <given-names>L. J.</given-names></name></person-group> (<year>2009</year>). <article-title>Using a national dairy database to identify herds with poor welfare</article-title>. <source>Anim. Welfare</source> <volume>18</volume>, <fpage>523</fpage>&#x02013;<lpage>532</lpage>.<pub-id pub-id-type="pmid">21999569</pub-id></citation></ref>
<ref id="B69">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schirmann</surname> <given-names>K.</given-names></name> <name><surname>Chapinal</surname> <given-names>N.</given-names></name> <name><surname>Weary</surname> <given-names>D. M.</given-names></name> <name><surname>Heuwieser</surname> <given-names>W.</given-names></name> <name><surname>von Keyserlingk</surname> <given-names>M. A. G.</given-names></name></person-group> (<year>2012</year>). <article-title>Rumination and its relationship to feeding and lying behavior in Holstein dairy cows</article-title>. <source>J. Dairy Sci.</source> <volume>95</volume>, <fpage>3212</fpage>&#x02013;<lpage>3217</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2011-4741</pub-id><pub-id pub-id-type="pmid">22612956</pub-id></citation></ref>
<ref id="B70">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schmidt</surname> <given-names>R. C.</given-names></name></person-group> (<year>1997</year>). <article-title>Managing delphi surveys using nonparametric statistical techniques</article-title>. <source>Decision Sci.</source> <volume>28</volume>, <fpage>763</fpage>&#x02013;<lpage>774</lpage>. <pub-id pub-id-type="doi">10.1111/j.1540-5915.1997.tb01330.x</pub-id></citation>
</ref>
<ref id="B71">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stangaferro</surname> <given-names>M. L.</given-names></name> <name><surname>Wijma</surname> <given-names>R.</given-names></name> <name><surname>Caixeta</surname> <given-names>L. S.</given-names></name> <name><surname>Al-Abri</surname> <given-names>M. A.</given-names></name> <name><surname>Giordano</surname> <given-names>J. O.</given-names></name></person-group> (<year>2016a</year>). <article-title>Use of rumination and activity monitoring for the identification of dairy cows with health disorders: part I. Metabolic and digestive disorders</article-title>. <source>J. Dairy Sci.</source> <volume>99</volume>, <fpage>7395</fpage>&#x02013;<lpage>7410</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2016-10907</pub-id><pub-id pub-id-type="pmid">27372591</pub-id></citation></ref>
<ref id="B72">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stangaferro</surname> <given-names>M. L.</given-names></name> <name><surname>Wijma</surname> <given-names>R.</given-names></name> <name><surname>Caixeta</surname> <given-names>L. S.</given-names></name> <name><surname>Al-Abri</surname> <given-names>M. A.</given-names></name> <name><surname>Giordano</surname> <given-names>J. O.</given-names></name></person-group> (<year>2016b</year>). <article-title>Use of rumination and activity monitoring for the identification of dairy cows with health disorders: part II. Mastitis</article-title>. <source>J. Dairy Sci.</source> <volume>99</volume>, <fpage>7411</fpage>&#x02013;<lpage>7421</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2016-10908</pub-id><pub-id pub-id-type="pmid">27372584</pub-id></citation></ref>
<ref id="B73">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Stangaferro</surname> <given-names>M. L.</given-names></name> <name><surname>Wijma</surname> <given-names>R.</given-names></name> <name><surname>Caixeta</surname> <given-names>L. S.</given-names></name> <name><surname>Al-Abri</surname> <given-names>M. A.</given-names></name> <name><surname>Giordano</surname> <given-names>J. O.</given-names></name></person-group> (<year>2016c</year>). <article-title>Use of rumination and activity monitoring for the identification of dairy cows with health disorders: part III. Metritis</article-title>. <source>J. Dairy Sci.</source> <volume>99</volume>, <fpage>7422</fpage>&#x02013;<lpage>7433</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2016-11352</pub-id><pub-id pub-id-type="pmid">27372583</pub-id></citation></ref>
<ref id="B74">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Thorup</surname> <given-names>V. M.</given-names></name> <name><surname>Nielsen</surname> <given-names>B. L.</given-names></name> <name><surname>Robert</surname> <given-names>P.-E.</given-names></name> <name><surname>Giger-Reverdin</surname> <given-names>S.</given-names></name> <name><surname>Konka</surname> <given-names>J.</given-names></name> <name><surname>Michie</surname> <given-names>C.</given-names></name> <etal/></person-group>. (<year>2016</year>). <article-title>Lameness affects cow feeding but not rumination behavior as characterized from sensor data</article-title>. <source>Front. Vet. Sci.</source> <volume>3</volume>:<fpage>37</fpage>. <pub-id pub-id-type="doi">10.3389/fvets.2016.00037</pub-id><pub-id pub-id-type="pmid">27243025</pub-id></citation></ref>
<ref id="B75">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van Os</surname> <given-names>J. M. C.</given-names></name> <name><surname>Winckler</surname> <given-names>C.</given-names></name> <name><surname>Trieb</surname> <given-names>J.</given-names></name> <name><surname>Matarazzo</surname> <given-names>S. V.</given-names></name> <name><surname>Lehenbauer</surname> <given-names>T. W.</given-names></name> <name><surname>Champagne</surname> <given-names>J. D.</given-names></name> <etal/></person-group>. (<year>2018</year>). <article-title>Reliability of sampling strategies for measuring dairy cattle welfare on commercial farms</article-title>. <source>J. Dairy Sci.</source> <volume>101</volume>, <fpage>1495</fpage>&#x02013;<lpage>1504</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2017-13611</pub-id><pub-id pub-id-type="pmid">29248223</pub-id></citation></ref>
<ref id="B76">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Van Reenen</surname> <given-names>C. G.</given-names></name> <name><surname>O&#x00027;Connell</surname> <given-names>N. E.</given-names></name> <name><surname>Van der Werf</surname> <given-names>J. T. N.</given-names></name> <name><surname>Korte</surname> <given-names>S. M.</given-names></name> <name><surname>Hopster</surname> <given-names>H.</given-names></name> <name><surname>Jones</surname> <given-names>R. B.</given-names></name> <etal/></person-group>. (<year>2005</year>). <article-title>Responses of calves to acute stress: Individual consistency and relations between behavioral and physiological measures</article-title>. <source>Physiol. Behav.</source> <volume>85</volume>, <fpage>557</fpage>&#x02013;<lpage>570</lpage>. <pub-id pub-id-type="doi">10.1016/j.physbeh.2005.06.015</pub-id><pub-id pub-id-type="pmid">16081113</pub-id></citation></ref>
<ref id="B77">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>van Staaveren</surname> <given-names>N.</given-names></name> <name><surname>Doyle</surname> <given-names>B.</given-names></name> <name><surname>Manzanilla</surname> <given-names>E. G.</given-names></name> <name><surname>Calder&#x000F3;n D&#x000ED;az</surname> <given-names>J. A.</given-names></name> <name><surname>Hanlon</surname> <given-names>A.</given-names></name> <name><surname>Boyle</surname> <given-names>L. A.</given-names></name></person-group> (<year>2017</year>). <article-title>Validation of carcass lesions as indicators for on-farm health and welfare of pigs</article-title>. <source>J. Anim. Sci.</source> <volume>95</volume>, <fpage>1528</fpage>&#x02013;<lpage>1536</lpage>. <pub-id pub-id-type="doi">10.2527/jas2016.1180</pub-id><pub-id pub-id-type="pmid">28464078</pub-id></citation></ref>
<ref id="B78">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vasseur</surname> <given-names>E.</given-names></name> <name><surname>Rushen</surname> <given-names>J.</given-names></name> <name><surname>Haley</surname> <given-names>D. B.</given-names></name> <name><surname>de Passill&#x000E9;</surname> <given-names>A. M.</given-names></name></person-group> (<year>2012</year>). <article-title>Sampling cows to assess lying time for on-farm animal welfare assessment</article-title>. <source>J. Dairy Sci.</source> <volume>95</volume>, <fpage>4968</fpage>&#x02013;<lpage>4977</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2011-5176</pub-id><pub-id pub-id-type="pmid">22916901</pub-id></citation></ref>
<ref id="B79">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Veissier</surname> <given-names>I.</given-names></name> <name><surname>Capdeville</surname> <given-names>J.</given-names></name> <name><surname>Delval</surname> <given-names>E.</given-names></name></person-group> (<year>2004</year>). <article-title>Cubicle housing systems for cattle: comfort of dairy cows depends on cubicle adjustment</article-title>. <source>J. Anim. Sci.</source> <volume>82</volume>, <fpage>3321</fpage>&#x02013;<lpage>3337</lpage>. <pub-id pub-id-type="doi">10.2527/2004.82113321x</pub-id><pub-id pub-id-type="pmid">15542480</pub-id></citation></ref>
<ref id="B80">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Waiblinger</surname> <given-names>S.</given-names></name> <name><surname>Knierim</surname> <given-names>U.</given-names></name> <name><surname>Winckler</surname> <given-names>C.</given-names></name></person-group> (<year>2001</year>). <article-title>The development of an epidemiologically based on-farm welfare assessment system for use with dairy cows</article-title>. <source>Acta Agri. Scand. A Anim. Sci.</source> <volume>51</volume>, <fpage>73</fpage>&#x02013;<lpage>77</lpage>. <pub-id pub-id-type="doi">10.1080/090647001316923108</pub-id></citation>
</ref>
<ref id="B81">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Walker</surname> <given-names>S. L.</given-names></name> <name><surname>Smith</surname> <given-names>R. F.</given-names></name> <name><surname>Routly</surname> <given-names>J. E.</given-names></name> <name><surname>Jones</surname> <given-names>D. N.</given-names></name> <name><surname>Morris</surname> <given-names>M. J.</given-names></name> <name><surname>Dobson</surname> <given-names>H.</given-names></name></person-group> (<year>2008</year>). <article-title>Lameness, activity time-budgets, and estrus expression in dairy cattle</article-title>. <source>J. Dairy Sci.</source> <volume>91</volume>, <fpage>4552</fpage>&#x02013;<lpage>4559</lpage>. <pub-id pub-id-type="doi">10.3168/jds.2008-1048</pub-id><pub-id pub-id-type="pmid">19038930</pub-id></citation></ref>
<ref id="B82">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Weary</surname> <given-names>D. M.</given-names></name> <name><surname>Huzzey</surname> <given-names>J. M.</given-names></name> <name><surname>von Keyserlingk</surname> <given-names>M. A. G.</given-names></name></person-group> (<year>2009</year>). <article-title>Board-Invited Review: using behavior to predict and identify ill health in animals</article-title>. <source>J. Anim. Sci.</source> <volume>87</volume>, <fpage>770</fpage>&#x02013;<lpage>777</lpage>. <pub-id pub-id-type="doi">10.2527/jas.2008-1297</pub-id><pub-id pub-id-type="pmid">18952731</pub-id></citation></ref>
<ref id="B83">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Webster</surname> <given-names>A. J. F.</given-names></name> <name><surname>Main</surname> <given-names>D. C. J.</given-names></name> <name><surname>Whay</surname> <given-names>H. R.</given-names></name></person-group> (<year>2004</year>). <article-title>Welfare assessment: indices from clinical observation</article-title>. <source>Anim. Welfare</source> <volume>13</volume>, <fpage>93</fpage>&#x02013;<lpage>98</lpage>.<pub-id pub-id-type="pmid">16638783</pub-id></citation></ref>
<ref id="B84">
<citation citation-type="book"><person-group person-group-type="author"><collab>Welfare Quality</collab></person-group> (<year>2009</year>). <source>Welfare Quality Assessment Protocol for Cattle</source> (<publisher-loc>Leystad</publisher-loc>: <publisher-name>W.Q. Consortium</publisher-name>).</citation>
</ref>
<ref id="B85">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wemelsfelder</surname> <given-names>F.</given-names></name></person-group> (<year>2007</year>). <article-title>How animals communicate quality of life: the qualitative assessment of behaviour</article-title>. <source>Anim. Welfare</source> <volume>16</volume>, <fpage>25</fpage>&#x02013;<lpage>31</lpage>.</citation>
</ref>
<ref id="B86">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wemelsfelder</surname> <given-names>F.</given-names></name> <name><surname>Hunter</surname> <given-names>E. A.</given-names></name> <name><surname>Mendl</surname> <given-names>M. T.</given-names></name> <name><surname>Lawrence</surname> <given-names>A. B.</given-names></name></person-group> (<year>2000</year>). <article-title>The spontaneous qualitative assessment of behavioural expressions in pigs: first explorations of a novel methodology for integrative animal welfare measurement</article-title>. <source>Appl. Anim. Behav. Sci.</source> <volume>67</volume>, <fpage>193</fpage>&#x02013;<lpage>215</lpage>. <pub-id pub-id-type="doi">10.1016/S0168-1591(99)00093-3</pub-id><pub-id pub-id-type="pmid">10736529</pub-id></citation></ref>
<ref id="B87">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wemelsfelder</surname> <given-names>F.</given-names></name> <name><surname>Hunter</surname> <given-names>T. E. A.</given-names></name> <name><surname>Mendl</surname> <given-names>M. T.</given-names></name> <name><surname>Lawrence</surname> <given-names>A. B.</given-names></name></person-group> (<year>2001</year>). <article-title>Assessing the &#x02018;whole animal&#x02019;: a free choice profiling approach</article-title>. <source>Anim. Behav.</source> <volume>62</volume>, <fpage>209</fpage>&#x02013;<lpage>220</lpage>. <pub-id pub-id-type="doi">10.1006/anbe.2001.1741</pub-id><pub-id pub-id-type="pmid">22745187</pub-id></citation></ref>
<ref id="B88">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Whay</surname> <given-names>H. R.</given-names></name> <name><surname>Main</surname> <given-names>D. C. J.</given-names></name> <name><surname>Green</surname> <given-names>L. E.</given-names></name> <name><surname>Webster</surname> <given-names>A. J. F.</given-names></name></person-group> (<year>2003a</year>). <article-title>Animal-based measures for the assessment of welfare state of dairy cattle, pigs and laying hens: consensus of expert opinion</article-title>. <source>Anim. Welfare</source> <volume>12</volume>, <fpage>205</fpage>&#x02013;<lpage>217</lpage>.</citation>
</ref>
<ref id="B89">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Whay</surname> <given-names>H. R.</given-names></name> <name><surname>Main</surname> <given-names>D. C. J.</given-names></name> <name><surname>Green</surname> <given-names>L. E.</given-names></name> <name><surname>Webster</surname> <given-names>A. J. F.</given-names></name></person-group> (<year>2003b</year>). <article-title>Assessment of the welfare of dairy cattle using animal-based measurements: direct observations and investigation of farm records</article-title>. <source>Vet. Rec.</source> <volume>153</volume>, <fpage>197</fpage>&#x02013;<lpage>202</lpage>. <pub-id pub-id-type="doi">10.1136/vr.153.7.197</pub-id><pub-id pub-id-type="pmid">12956296</pub-id></citation></ref>
<ref id="B90">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Whay</surname> <given-names>H. R.</given-names></name> <name><surname>Webster</surname> <given-names>A. J. F.</given-names></name> <name><surname>Waterman-Pearson</surname> <given-names>A. E.</given-names></name></person-group> (<year>2005</year>). <article-title>Role of ketoprofen in the modulation of hyperalgesia associated with lameness in dairy cattle</article-title>. <source>Vet. Rec.</source> <volume>157</volume>, <fpage>729</fpage>&#x02013;<lpage>733</lpage>. <pub-id pub-id-type="doi">10.1136/vr.157.23.729</pub-id><pub-id pub-id-type="pmid">16326965</pub-id></citation></ref>
<ref id="B91">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Wiech</surname> <given-names>K.</given-names></name> <name><surname>Tracey</surname> <given-names>I.</given-names></name></person-group> (<year>2009</year>). <article-title>The influence of negative emotions on pain: behavioral effects and neural mechanisms</article-title>. <source>NeuroImage</source> <volume>47</volume>, <fpage>987</fpage>&#x02013;<lpage>994</lpage>. <pub-id pub-id-type="doi">10.1016/j.neuroimage.2009.05.059</pub-id><pub-id pub-id-type="pmid">19481610</pub-id></citation></ref>
<ref id="B92">
<citation citation-type="book"><person-group person-group-type="author"><name><surname>Winckler</surname> <given-names>C.</given-names></name></person-group> (<year>2014</year>). <article-title>&#x0201C;Inter-observer agreement for qualitative behaviour assessment in dairy cattle in three different countries,&#x0201D;</article-title> in <source>WAFL 2014 : Proceedings of the 6th International Conference on the Assessment of Animal Welfare at Farm and Group Level</source>, eds <person-group person-group-type="editor"><name><surname>Mounier</surname> <given-names>L.</given-names></name> <name><surname>Veissier</surname> <given-names>I.</given-names></name></person-group>. (<publisher-loc>Wageningen</publisher-loc>: <publisher-name>Wageningen Academic Publishers</publisher-name>), <fpage>181</fpage>.</citation>
</ref>
<ref id="B93">
<citation citation-type="journal"><person-group person-group-type="author"><name><surname>Winckler</surname> <given-names>C.</given-names></name> <name><surname>Capdeville</surname> <given-names>J.</given-names></name> <name><surname>Gebresenbet</surname> <given-names>G.</given-names></name> <name><surname>H&#x000F6;rning</surname> <given-names>B.</given-names></name> <name><surname>Roiha</surname> <given-names>U.</given-names></name> <name><surname>Tosi</surname> <given-names>M.</given-names></name> <etal/></person-group>. (<year>2003</year>). <article-title>Selection of parameters for on-farm welfare-assessment protocols in cattle and buffalo</article-title>. <source>Anim. Welfare</source> <volume>12</volume>, <fpage>619</fpage>&#x02013;<lpage>624</lpage>.</citation>
</ref>
</ref-list> 
</back>
</article>