Accuracy of genome-enabled polygenic risk score prediction of cruciate ligament rupture risk in the Labrador Retriever

Miranda, Benjamin; Momen, Mehdi; Sample, Susannah J.; Muir, Peter

doi:10.3389/fvets.2025.1625953

ORIGINAL RESEARCH article

Front. Vet. Sci., 26 August 2025

Sec. Comparative and Clinical Medicine

Volume 12 - 2025 | https://doi.org/10.3389/fvets.2025.1625953

Accuracy of genome-enabled polygenic risk score prediction of cruciate ligament rupture risk in the Labrador Retriever

Benjamin Miranda

Mehdi Momen^*

Susannah J. Sample

Peter Muir^*

Comparative Orthopaedic & Genetics Research Laboratory, Department of Surgical Science, School of Veterinary Medicine, University of Wisconsin-Madison, Madison, WI, United States

Introduction: Canine cruciate ligament rupture (CR) is a common, complex, polygenic, orthopaedic disease in dogs that results in serious financial burden and patient morbidity even in the face of surgical correction. The goal of this study was to evaluate the clinical utility of CR polygenic risk score (PRS) prediction models using genome-wide SNP data from a large reference population of Labrador Retriever dogs.

Methods: Using 10-fold cross-validation and an independent validation population, we assessed Bayesian and machine learning models with and without covariates using both genome-wide SNPs as well as genic SNPs. Models were tuned by optimizing numbers of CR risk SNPs selected by genome-wide association and adjusting posterior probability thresholds to maximize prediction accuracy.

Results: Models that included clinical covariates (sex, neuter status, age, weight, withers height, as well as the first 10 principal components from the genetic relationship matrix) universally yielded higher accuracy up to 88.5% compared to 77% without covariates. Prediction accuracy for some models was reduced when only genic SNPs were used suggesting SNPs in non-coding regions could influence the CR disease risk.

Discussion: Our results confirm that PRS models provide sufficient predictive accuracy for clinical application in veterinary medicine and offer a viable, early-life screening tool for personalized care and selective breeding to reduce CR incidence in high-risk breeds. Our results further confirm that CR is a complex polygenic disease in which genome-wide risk SNPs influence disease pathogenesis.

Introduction

Canine cruciate ligament rupture (CR) is one of the most common orthopaedic diseases encountered in veterinary medicine (1). The disease often results in serious long-term sequelae such as reduced mobility from osteoarthritis even with surgical stabilization of the stifle since osteoarthritis is typically established at diagnosis (2). With a high rate of contralateral rupture, CR results in a high patient morbidity and a high economic burden to owners (3, 4). CR is a complex polygenic disease in which both environmental and genetic risk contribute to disease progression (5). Some of these factors include breed predisposition (6), ligament matrix degeneration (7), obesity (8), conformation (8), and joint immune responses (7). In addition, ligament rupture is usually a consequence of complex pathogenesis where polygenic effects on various physiological pathways affect cruciate ligament homeostasis in different ways that promote fatigue injury to collagen fibers with progressive fiber rupture in the presence of synovitis as the cause of the majority of non-contact CR rather than a single cycle mechanical overload of the cranial cruciate ligament (5). The concept that CR is a heritable disease rather than an injury aligns with a growing body of evidence in the human literature regarding non-contact ACL rupture (5, 9, 10).

The prevalence of CR is breed dependent with heritability estimates ranging from 0.27–0.85 in dogs (11–14). Breeds with high prevalence, such as the Labrador Retriever, Rottweiler, and Newfoundland, have a concentration of risk loci because of breed selection (15, 16). Genomic studies in dogs have shown CR is highly polygenic in the Labrador Retriever (5, 11). Genome-wide association studies have identified few large effect and numerous small effect genetic variants suggesting CR is primarily a polygenic disease (5, 17). Current heritability estimates in the Labrador Retriever (0.52–0.63) suggest CR is a disease with moderate to high heritability (5). Studies of the genetic architecture of CR in the Labrador Retriever have also shown that risk of CR is influenced by coat color (18). Many risk genes are also shared with human ACL rupture (5).

For complex heritable diseases, polygenic risk score (PRS) prediction enables quantification of an individual's risk by assuming all single nucleotide polymorphisms (SNPs) are disease-associated risk variants even if their effect is very small (11, 19). These variants in combination influence disease risk and can be analyzed by risk models to estimate the probability of an individual developing the disease over their lifetime (20). So, a PRS value represents the heritable risk of developing a disease in an individual based on the total number of significant genetic variants they have (21). PRS prediction is widely used in the study of human complex polygenic disease and is now being increasingly studied in companion animals in veterinary medicine (5, 21).

In the current study, our goal was to validate PRS prediction of the risk of CR in the Labrador Retriever, as the Labrador is one of the high-risk breeds with an increased prevalence above the general population at 5.79% (6). Our previous research has generated a large reference population of Labrador Retrievers accurately phenotyped as CR cases or controls, enabling definitive estimates of heritability, genetic architecture, and initial PRS prediction using cross validation in this reference population (5). The purpose of the present study was to continue clinical development of PRS prediction of risk of CR in the Labrador Retriever by using a new validation population to confirm PRS prediction has sufficient accuracy for clinical use (5, 20).

Materials and methods

Data collection and phenotyping

Client-owned Labrador Retriever dogs were recruited at the University of Wisconsin-Madison School of Veterinary Medicine through online advertising, local, and national breed clubs for the validation group. All procedures were performed in accordance with the recommendations in the Guide for the Care and Use of Laboratory Animals of the National Institutes of Health and the American Veterinary Medical Association and IACUC approval (V5463). All owners gave informed consent. Purebred status was confirmed from a pedigree for each dog. Relatedness between individuals was screened via pedigree review and siblings were excluded to reduce Type 1 error rates. Dogs were phenotyped by orthopaedic exam and lateromedial stifle radiographs. Dogs were considered a case if they had CR diagnosed by a veterinarian with most cases having their CR confirmed during surgical stifle stabilization. Labrador Retriever dogs were classified as a control if >8 years of age, had both stifles palpated as stable by a veterinarian, and no evidence of stifle effusion or osteophytosis on stifle radiograph that would be indicative of a CR (22). This age threshold was chosen because Labrador Retrievers ≥8 years have an ~6% chance of experiencing CR (23). Age, weight, withers height, sex, neuter status, and coat color were also recorded. If a control dog subsequently developed a CR, the phenotype was updated.

Sample populations and SNP genotyping quality control

DNA was obtained from blood or saliva samples. SNP genotyping was performed using Illumina CanineHD BeadChip containing ~230,000 SNPs across the canine genome (CanFam3.1). The reference or training population (TRN) group of Labrador Retriever dogs contained 1,006 dogs (440 cases, 556 controls). The meta dataset was made up of dogs recruited at UW-Madison 719 Labrador Retrievers (326 cases, 383 controls) and the second was provided by Cornell University 287 Labrador Retrievers (114 cases, 173 controls) (5). The covariate phenotypes were not available for the dataset from Cornell University. The validation or testing group (TST) of Labrador Retrievers consisted of 52 dogs (24 cases, 28 controls). Within cases in the TST group, there were 8 neutered males, 3 intact males, 12 ovariohysterectomized females, and 1 intact female. Within controls, there were 7 neutered males, 7 intact males, 13 ovariohysterectomized females, and 1 intact female.

Quality control filtering of genotypic data was performed using PLINK v1.9 software (24). Samples with a genotyping call rate below 95% were excluded. SNPs were removed from the dataset if they had a minor allele frequency (MAF) < 0.01, had a genotyping call rate ≤ 95%, or if they deviated from Hardy-Weinberg proportions at a P < 1E-06. Missing genotypes were imputed using Beagle 5.4 software (25, 26). SNP data quality control resulted in 142,071 SNPs remaining.

Experimental design for polygenic risk score prediction

The bioinformatics approach for our analysis is summarized in Figure 1. The TRN group of 1,006 Labrador Retriever dogs was used for model fitting. The TST group of 52 Labrador Retriever dogs was used as an independent validation sample. Each dog in the TST group had a predicted phenotype from their PRS value and a true CR case and control phenotype. Eight statistical models composed of four Bayesian regression models and four machine-learning models were used to estimate the predicted phenotype. The Bayesian models were Bayesian Ridge Regression (BRR), Bayesian Lasso (BL), Bayes B (BB), and Bayes C (BC) (5, 20). All the Bayesian models were fitted using the BGLR package (27). The four machine-learning models were Least Absolute Shrinkage and Selection Operator (LASSO), Support Vector Machine (SVM), Random Forest (RF), and Elastic Net (EN) (28). EN and LASSO were fitted using the glmnet function from the glmnet R-package (29). RF was implemented using the R-package “wsrf” (30) and SVM used the e1071 package (31).

Figure 1

Flowchart showing the process of data analysis in Labrador Retriever dogs. It begins with data collection and is split into a training set (1,006) and validation set (52). Canine HD genotyping leads to data quality control (QC). This feeds into a GWS MM model and involves CV for top SNP selection, machine learning classifier using Bayesian probit regression, and threshold tuning. The process loops with training sets and validation set evaluation.

Figure 1. Flowchart illustrating the workflow for polygenic risk score prediction and threshold tuning of case-controls for cruciate ligament rupture in the Labrador Retriever. Data from 1,064 dogs were used as the reference training set (n = 1,006) and SNP effects were tested on an independent validation set (n = 52). All samples were genotyped by Canine HD BeadChip and quality controlled. Genome-wide selection (GWS) using a mixed-model (MM) approach was applied, followed by 10-fold cross-validation (CV) for top risk SNP selection. Predictive modeling employed machine learning classifiers, Bayesian probit regression models, and ensemble logistic regression, with threshold tuning finalized using the validation dataset. Ten-fold cross validation was then rerun after optimization of risk SNP selection and threshold tuning.

Ten-fold cross validation

Initially, 10-fold cross validation was performed using the reference TRN group. The data were randomly partitioned into 10-folds, with nine of the folds used for model training and the 10^th used as a test set. Each test set was assessed in turn until all 10-folds had been evaluated. The partition scheme used was like that in Baker et al. (20). The advantage of multiple-fold cross validation is that it allows the training dataset to remain large without sacrificing a portion of the dataset for testing. The predictions were aggregated from the 10 folds and averaged across the runs.

Prediction performance for each model was assessed using accuracy (ACC) and the area under the receiver operator characteristic (ROC) curve (AUC). After obtaining a posterior probability for all folds, we computed the prediction accuracy metrics. Clinical covariates were included in our cross-validation analysis and were sex, neuter status, weight, age, and withers height, as well as 10 principal components (PCs) from the genetic relationship matrix using all dogs. We also computed a genomic relationship matrix (GRM) and verified that the average genomic relationship between the reference (TRN) and validation (TST) populations was close to zero (Mean = −0.01, SD = 0.038). ACC values were also calculated using the tuned posterior probability thresholds (see below).

Optimization of risk SNP selection for each statistical model

Because each statistical model has a different analytical approach, different models often perform optimally with differing numbers of CR risk SNPs. Risk SNPs were selected based on strength of association with CR by genome-wide association study (GWAS) using a threshold mixed model for a binary trait (32). Cross-validation models were run with different numbers of risk SNPs to determine the optimum performance.

Optimization of the posterior probability threshold for distinguishing cases and controls

After obtaining the posterior probability for each individual dog in the validation TST group using the eight models, we determined the optimum threshold that maximized CR risk prediction accuracy and best distinguish cases from control. Youden's J statistic (P^*), ACC, and geometric mean (gMean) were used as metrics for threshold optimization (33). The Youden's J statistic computed as P^* = |FPR + TPR – 1| where FPR is the false positive rate and TPR is the true positive rate. FPR represents the proportion of incorrectly identified positive results. FPR = FP/(FP + TN) where FP (False Positive) are the positive results incorrectly predicted, and TN (True Negative) are the negative results correctly predicted. TPR represents the proportion of correctly identified positive results and is also known as Sensitivity. TPR = TP/(TP + FN) where TP (True Positive) are the positive results correctly predicted, and FN (False Negative) are the negative results incorrectly predicted. The gMean, or geometric mean, a metric that is particularly valuable in binary classification tasks, focuses on the balance between the TPR (Sensitivity) and the TNR (Specificity); gMean = $\sqrt{(T P R^{*} T N R)}$ (34). TNR is the true negative rate, also known as Specificity, and represents the proportion of correctly identified negative results. Furthermore, the predictive accuracy of models (ACC) defined as $A C C = \frac{(T P + T N)}{(T P + T N + F P + F N)}$ , was evaluated for all thresholds. Then, a grid search was performed, looking at a range of posterior probabilities from 0 to 1 with 0.025 intervals to find the most reliable cut off threshold for distinguishing CR cases from controls according to the posterior probability.

Genic ontology and CR prediction according to genic region

A gene list was collected by referencing all genes related to CR from previous publications as summarized in Table 3 in Baker et al. (17). A pathway and gene ontology (GO) analysis was then performed using the most significant GO terms or wiki pathways associated with CR to verify each gene's relationship with the biology of CR. The GO is a database compiling the biological background of genes and gene products across species. Next, all genetic variants located within the gene list's genic regions without flanking regions were extracted for prediction purposes. Genic regions were identified for each gene using the UCSC Genome Browser with the transcription and coding start and stop coordinates, respectively, used to define each gene location. The TRN group of 1,006 Labrador Retriever dogs was used for model training and the TST group of Labrador Retriever dogs for validation. The eight previously used models were used to measure all predictive performance metrics.

Assessment of CR prediction accuracy

A posterior probability from each model was generated for each dog in the TST group. Ensembles of the Bayesian models or machine learning models were also used to generate an average posterior probability for individual dog risk prediction as a CR case or control. This predicted phenotype was then compared to the true phenotype of each dog. AUC and three different coefficients of determination (R²) metrics were calculated to assess the predictive performance of different model scenarios. AUC was calculated using the pROC R package (34–38). The three R² values were Cox and Snell's R² ( $R_{C & S}^{2}$ ) (39–41), Efron's R² ( $R_{e f}^{2}$ ) (42), and Nagelkerke's R² ( $R_{n a g}^{2}$ ) (43). We evaluated all models with and without considering covariates and considered either whole genome SNPs or genic only SNPs. The covariate variables we considered were sex, neuter status, weight, age, and withers height as well as 10 principal components from the genetic relationship matrix. The TRN group of Labrador Retrievers were used as the training set and the TST group of dogs for validation.

Results

Optimization of CR risk SNPs for PRS prediction

The top GWAS SNPs were ranked based on P-value, and different percentages of top SNPs were selected to further evaluate model performance based on analysis using 10-fold cross-validation. The optimal predictive ability for CR risk varied across statistical models. The best predictive performance with all Bayesian models was obtained using 30% of the SNPs (42,621 SNPs). The best performance with RF was achieved using the top 1% of the SNPs (1,420 SNPs), while the best predictive performance for EN and LASSO was obtained using 2% of the SNPs (2,841 SNPs). The best performance with SVM was obtained using 7% of the SNPs (9,944 SNPs).

Model performance using top GWAS SNPs, threshold optimization with the test group, and subsequent 10-fold cross validation in the reference group

We used the P^*, ACC, and gMean metrics for model tuning to determine the optimum threshold points for each model using Labrador Retriever dogs in the TST group. We considered the threshold as optimal when the ACC and gMean were highest and P^* was minimum. Optimal thresholds ranged from 0.425 to 0.5 (Figure 2, Table 1). Among machine learning models, the highest ACC was achieved by the LASSO machine learning model after model tuning (0.844) and the highest AUC was achieved with the SVM model (0.838) at P^* = 0.021 (Table 1). For Bayesian models, the highest ACC was observed with Bayes C (0.836) at P^* = 0.1 (Table 1), which also achieved the highest AUC (0.83).

Figure 2

Nine graphs compare different models: EN, Bayes B, LASSO, Bayes C, RF, BL, SVM, BRR, ML ensemble, and BM ensemble. Each graph plots threshold on the x-axis and three metrics on the y-axis: ACC (red), gMean (green), and P* (blue). Marked intersections show specific values for each metric.

Figure 2. A grid search was performed to identify the optimum threshold range and evaluate model performance for prediction of cruciate ligament rupture case status. Each data point represents a threshold ranging from 0 to 1 with 0.025 intervals. Lower P* values aligned with higher ACC and gMean values. The circle dots show the optimum point for each model. EN, Elastic Net; LASSO, Least Absolute Shrinkage and Selection Operator; RF, Random Forest; SVM, Support Vector Machine; ML, machine learning; BL, Bayesian Lasso; BRR, Bayesian Ridge Regression. The analysis used a training group of 1,006 Labrador Retrievers for model training and a test group of 52 Labradors for prediction optimization.

Table 1

Table 1. Accuracy of Bayesian and machine learning statistical models for prediction of cruciate ligament rupture risk using polygenic risk scores in the Labrador Retriever reference population using 10-fold cross validation, top genome-wide risk SNPs, and tuned posterior probability thresholds.

Tuned model performance with and without covariates in the independent validation test set using GWAS top SNPs

Predictive performance with covariates yielded higher ACC for all algorithms after model tuning except RF and results are summarized in Table 2. With covariates and posterior probability threshold tuning, the LASSO and EN algorithms yielded the highest ACC (0.885). Amongst the Bayesian models, the BL and BayesC algorithms yielded the highest ACC (0.842). These models also yielded the highest AUC values. Without covariates, ACC values were lower, and the Bayesian ensemble approach yielded the highest ACC (0.769) and AUC (0.768). Without covariates Bayesian models outperformed machine learning models.

Table 2

Table 2. Polygenic risk score prediction accuracy for cruciate ligament rupture in the Labrador Retriever validation group using Bayesian and machine learning models with and without covariates and top genome-wide risk SNPs.

Predicting cruciate ligament rupture risk using genic SNPs

The 41 CR risk genes previously identified in the literature were functionally verified with the GO terms analysis to confirm their relationship with CR. Each GO term represents a particular biological process in the body, and we counted the number of genes from our list that matched each GO term (Figure 3). Amongst the twenty GO terms, extracellular matrix organization (P = 4.89E-52), degradation of the extracellular matrix (P = 5.5E-32), collagen metabolic process (P = 2.3E-14), skeletal system development (P = 6.9E-14), elastic fiber formation (P = −1.78E-13), response to growth factor (P = 6.8E-11), ossification (P = 1.38E-10), regulation of the extracellular matrix organization (P = 2.75E-6), and Type 1 collagen synthesis (P = 1.23E-5) were particularly associated with CR.

Figure 3

Horizontal bar chart showing biological processes ranked by significance, measured in -log10(P). The processes include extracellular matrix organization, burn wound healing, collagen metabolic process, and more, with corresponding identifiers on the right. Bars are color-coded, ranging from dark red to light yellow, representing significance levels.

Figure 3. Gene ontology (GO) term and wiki pathway analysis for association with cruciate ligament rupture (CR). The graph compares each GO term's P-value for association with CR to the other GO terms. The ID and name for each GO term is listed for each column.

PRS prediction was also performed with and without covariates using genic SNPs. With covariates, Bayesian model performance was improved for the BRR, BL, BayesB, and BayesC algorithms, and for the RF and SVM machine learning models, compared with the analysis with genome-wide SNPs (Tables 2, 3). The LASSO and the BL algorithms had the highest ACC (0.875) (Table 3). For the machine learning models with covariates, AUC was highest with the LASSO model (0.871) and for the Bayesian models with covariates the BL model had the highest AUC (0.874). Without covariates, only the LASSO, EN, and ensemble machine learning models exhibited enhanced performance (Tables 2, 3). The LASSO and the EN algorithms had the highest ACC (0.731) without covariates. For the machine learning models without covariates, AUC was highest with the ensemble model (0.731) and for the Bayesian models without covariates the BayesB model had the highest AUC (0.712) (Table 3).

Table 3

Table 3. Polygenic risk score prediction accuracy for cruciate ligament rupture in the Labrador Retriever validation group using Bayesian and machine learning models with and without covariates and genic SNPs.

Discussion

Canine CR is a common orthopaedic disease with a high economic burden from long-term morbidity due to the development of stifle osteoarthritis even in the face of surgical correction (4, 5, 44), so population screening to identify dogs with elevated risk would be an impactful development (5, 45). CR in the Labrador Retriever is a complex heritable disease made up of numerous small effect SNPs and relatively few large effect ones (5). Age of neutering is an important environmental effect (46).

PRS prediction is a powerful tool for defining the heritable risk of developing a disease in an individual subject and is well suited to quantifying a subject's risk for highly complex heritable diseases (19, 20). Such an approach has been extensively used to assess risk of human complex heritable diseases (44). Overall, prediction accuracy was similar between the statistical models we studied. Validation data from the current study suggest PRS prediction of risk of CR in the Labrador Retriever is sufficiently accurate for use as a clinical screening tool for personalized medical care and selection for breeding with a prediction accuracy up to 88% with inclusion of covariates and up to 77% with analysis of only genetic information. Given that PRS prediction only needs a DNA sample easily obtained from a saliva swab, such testing can be performed in puppies before sale to the public, which is potentially advantageous compared with phenotypic screening later in life when dogs may already have been used for breeding or undergone training as a working dog. Additionally, PRS risk prediction testing can provide owners with information that can guide personalized care of the individual dog, particularly regarding modifiable environmental risk factors, such as neutering before 1 year of age (46).

We have recruited a large reference population of Labrador Retrievers over several years that we used as a TRN group for PRS prediction modeling using 10-fold cross validation in the present study. During the initial cross-validation analysis, we found that the optimal SNP set varied amongst the prediction models studied, as previous research has suggested (5). So, our analysis also considered use of an optimal number of SNPs for each model that maximized prediction accuracy. Optimal SNP set size was variable between models with the Bayesian models having the best ability to handle larger number of SNPs in the model training set.

We also found that tuning of the posterior probability threshold led to additional gains in ACC in classifying CR cases from controls, as opposed to using a single threshold of 0.5 for all models (20). In our analysis, we found that ACC and P^* did not always align exactly on a specific threshold probability. In this scenario, we emphasized ACC and gMean in our tuning optimization, as ACC is the most clinically relevant parameter describing predictive ability. With the inclusion of individualized optimal posterior probability thresholds, prediction models generally surpassed an ACC of 0.8 with 10-fold cross validation. With our analysis of the validation TST group, model predictions also surpassed an ACC of 0.8 when covariates were included in the model, except for the RF model, suggesting our CR genetic risk prediction approach is a clinically relevant genetic test. Machine learning models, such as RF, require tuning for optimal performance and the weaker performance of this model is likely due to problems with model tuning.

A drawback to 10-fold cross validation, is its tendency to overfit the data, resulting in artificially high PRS scores, because of relatedness between individuals in an inbred population, even if the population is a large one. With our initial 10-fold cross validation within the reference population, ACC was generally above 0.8, but when the validation population was tested without consideration of covariates, ACC fell below 0.7 for machine learning models and below 0.8 for Bayesian models, suggesting overfitting was present in the 10-fold cross validation analysis. This highlights the importance of accounting for population structure and relatedness in predictive modeling, as failure to do so may lead to inflated performance metrics and poor generalizability. Using an external, independent validation group of subjects can help mitigate overfitting by reducing data leakage and ensuring better model robustness. Moreover, consideration of covariates may enhance the validity of PRS prediction, particularly in genetically homogenous or related populations. Covariates are variables that can influence the phenotype independently of genetic risk. Identification of and inclusion of covariates helps separate the contributions of genetic effects from broader physiological or developmental factors and allow the models to more appropriately attribute variation in the results to genetic predictors (47).

Previous work on PRS prediction of CR risk in dogs has shown that the inclusion of covariates in PRS prediction increases accuracy (20). This observation was recapitulated in the present study. We found that the inclusion of sex, neuter status, age, weight, and withers height consistently produced higher predictive accuracy with both our GWAS SNP analysis and the genic only SNP analysis. Without covariates, the highest ACC was 0.769 using a Bayesian ensemble approach. With covariates, the highest ACC was 0.885 using the LASSO and EN machine learning models.

The covariates we considered are readily acquired during routine clinical assessment. This enhances the clinical utility of the PRS models described in this report, as it enables their integration into existing veterinary workflows without the need for additional or specialized clinical assessment or testing. Cost is a significant barrier to veterinary healthcare and obtaining the necessary covariates for our modeling can be done at minimal to no cost to clients. Use of our analytical approach promises early identification of disease risk and provision of timely information for owners by helping to assess a dog's suitability for breeding, or working, and for injury prevention.

We also considered PRS prediction using only genic SNPs. Given that CR is a highly polygenic disease in which risk SNPs are spread throughout the genome (5), we expected limiting the number of SNPs to genic regions would reduce predictive AUC and ACC. We found that the LASSO, EN, and machine learning ensemble models that considered covariates had reduced ACC, but RF and SVM had higher ACC. With the BRR, BL, BayesB, and BayesC models, ACC was also improved by consideration of only genic SNPs, suggesting these models may better capture additive and non-linear effects in genic regions. This could be due to the exclusion of non-genic SNPs reducing statistical noise resulting to enhance signal-to-noise ratio. Genic regions are more likely to contain variants with direct biological relevance making it easier for models with shrinkage or feature selection such as Bayesian or tree-based methods to detect meaningful associations. The finding that performance was reduced with some models when only genic SNPs were considered further supports the notion that the genetic architecture of CR involves both coding and non-coding regulatory elements (48). Collectively our findings suggest both genic and non-genic variants play important, complementary roles in PRS prediction and both need to be considered when conducting PRS analysis (5).

There are several limitations to this research. Our validation population was relatively small compared to the reference population used to train our PRS prediction models. Further expansion of both the TRN and TST groups of dogs would likely further elevate the power of our analysis and provide more robust results. The slight mismatch in the nadir of P^* with the peaks in ACC and gMean in our analysis may be indicative of the small sample size used for the validation population. Our analysis only considered the Labrador Retriever. Our gene ontology analysis was based on a candidate gene list that was recently published (17). Other approaches to generation of a gene list could have been used such as genes associated with flanking regions around significant GWAS SNPs. Whilst coat color is known to be associated with CR risk in dogs (18), the coat color phenotype was not available for all dogs in the reference population. Also, GWAS risk SNPs should capture coat color genetic effects. Consequently, coat color was not included as a covariate in our bioinformatics approach to avoid artificially amplifying the risk associated with SNP markers in LD with both coat color and ACL risk.

Previous work from our laboratory suggests there is heterogeneity in the genetic contribution to CR in different breeds of dog (5). Further investigation into this aspect of the genetic contribution to CR is needed in other high-risk breeds such as the Rottweiler and Newfoundland. Ultimately, development of a bioinformatics PRS prediction approach that overcomes this problem would substantially enhance the clinical impact of genetic risk testing for CR in dogs.

In conclusion, our findings suggest that PRS prediction of risk of CR in the Labrador Retriever has sufficient predictive utility for clinical application using only genetic markers with an ACC of 77% with genome-wide SNPs and a Bayesian ensemble approach. We identified further gains in ACC with inclusion of additional readily obtainable clinical covariates yielding an ACC of 88.5% with genome-wide SNPs and a machine learning approach using the LASSO or EN algorithms. Clinically, genetic risk prediction testing has great utility and can be used by breeders during selection for breeding without the need for radiographic testing or waiting years to make an epidemiological determination of the CR status of the dog (23). Additionally, genetic risk testing for CR can be used for screening of individual dogs, particularly working dogs that undertake athletic activity where develop of CR would impair performance. Improved personalized care of the individual patient should focus on correcting modifiable environmental factors in dogs with high genetic risk (46).

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://figshare.com/, doi: 10.6084/m9.figshare.28641896 and https://datadryad.org/stash, doi: 10.5061/dryad.47d7wm3ns.

Ethics statement

The animal studies were approved by the University of Wisconsin-Madison School of Veterinary Institutional Animal Care and Use Committee. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent was obtained from the owners for the participation of their animals in this study.

Author contributions

BM: Writing – original draft, Methodology, Data curation, Investigation, Writing – review & editing. MM: Validation, Writing – review & editing, Visualization, Formal analysis, Writing – original draft, Supervision, Methodology, Data curation, Investigation, Conceptualization. SS: Resources, Writing – review & editing, Funding acquisition, Supervision, Data curation. PM: Methodology, Data curation, Project administration, Validation, Conceptualization, Writing – review & editing, Funding acquisition, Supervision, Resources, Investigation.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. A grant from the University of Wisconsin-Madison, School of Veterinary Medicine Companion Animal Fund was used to support this work. The Melita Grunow Family Professorship awarded to Dr. Muir also provided support.

Acknowledgments

The authors would like to thank the faculty, residents, and students throughout the University of Wisconsin-Madison UW Veterinary Care Hospital and the many individual breeders and pet owners for their help in recruitment of Labrador Retrievers for this study and the community gift funding provided for this work. Further, the authors also gratefully acknowledge the support of Dr. Rory Todhunter of Cornell University who provided some of the Labrador Retriever SNP data that form part of the reference population.

Conflict of interest

The authors of this manuscript have the following competing interests: PM and MM are named on US Patent US20160222451A1 “Method to predict heritable canine non-contact cruciate ligament rupture.” This does not alter our adherence to journal policies on data sharing and materials.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Gen AI was used in the creation of this manuscript.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Engdahl K, Hanson J, Bergström A, Bonnett B, Höglund O, Emanuelson U. The epidemiology of stifle joint disease in an insured Swedish dog population. Vet Rec. (2021) 189:197. doi: 10.1002/vetr.197

PubMed Abstract | Crossref Full Text | Google Scholar

2. Hayashi K, Kim SY, Lansdowne JL, Kapatkin A, Déjardin LM. Evaluation of a collagenase generated osteoarthritis biomarker in naturally occurring canine cruciate disease. Vet Surg. (2009) 38:117–21. doi: 10.1111/j.1532-950X.2008.00446.x

PubMed Abstract | Crossref Full Text | Google Scholar

3. Buote N, Fusco J, Radasch R. Age, tibial plateau angle, sex, and weight as risk factors for contralateral rupture of the cranial cruciate ligament in Labradors. Vet Surg. (2009) 38:481–9. doi: 10.1111/j.1532-950X.2009.00532.x

PubMed Abstract | Crossref Full Text | Google Scholar

4. Wilke VL, Robinson DA, Evans RB, Rothschild MF, Conzemius MG. Estimate of the annual economic impact of treatment of cranial cruciate ligament injury in dogs in the United States. J Am Vet Med Assoc. (2005) 227:1604–7. doi: 10.2460/javma.2005.227.1604

PubMed Abstract | Crossref Full Text | Google Scholar

5. Momen M, Kearney HK, Patterson MM, Sample SJ, Zhao Z, Lu Q, et al. Cross-species analysis of genetic architecture and polygenic risk scores for non-contact ACL rupture in dogs and humans. Commun Biol. (2025) 8:26. doi: 10.1038/s42003-024-07395-9

PubMed Abstract | Crossref Full Text | Google Scholar

6. Witsberger TH, Villamil JA, Schultz LG, Hahn AW, Cook JL. Prevalence of and risk factors for hip dysplasia and cranial cruciate ligament deficiency in dogs. J Am Vet Med Assoc. (2008) 232:1818–24. doi: 10.2460/javma.232.12.1818

PubMed Abstract | Crossref Full Text | Google Scholar

7. Bleedorn JA, Greuel EN, Manley PA, Schaefer SL, Markel MD, Holzman G, et al. Synovitis in dogs with stable stifle joints and incipient cranial cruciate ligament rupture: a cross-sectional study. Vet Surg. (2011) 40:531–43. doi: 10.1111/j.1532-950X.2011.00841.x

PubMed Abstract | Crossref Full Text | Google Scholar

8. Comerford EJ, Smith K, Hayashi K. Update on the aetiopathogenesis of canine cranial cruciate ligament disease. Vet Comp Orthop Traumatol. (2011) 24:91–8. doi: 10.3415/VCOT-10-04-0055

PubMed Abstract | Crossref Full Text | Google Scholar

9. Beaulieu ML, Ashton-Miller JA, Wojtys EM. Loading mechanisms of the anterior cruciate ligament. Sports Biomech. (2023) 22:1–29. doi: 10.1080/14763141.2021.1916578

PubMed Abstract | Crossref Full Text | Google Scholar

10. Magnusson K, Turkiewicz A, Hughes V, Frobell R, Englund M. High genetic contribution to anterior cruciate ligament rupture: Heritability ~69%. Br J Sports Med. (2021) 55:385–9. doi: 10.1136/bjsports-2020-102392

PubMed Abstract | Crossref Full Text | Google Scholar

11. Baker LA, Kirkpatrick B, Rosa GJM, Gianola D, Valente B, Sumner JP, et al. Genome-wide association analysis in dogs implicates 99 loci as risk variants for anterior cruciate ligament rupture. PLoS One. (2017) 12:e173810. doi: 10.1371/journal.pone.0173810

PubMed Abstract | Crossref Full Text | Google Scholar

12. Cook SR, Conzemius MG, McCue ME, Ekenstedt KJ. SNP-based heritability and genetic architecture of cranial cruciate ligament rupture in Labrador Retrievers. Anim Genet. (2020) 51:824–8. doi: 10.1111/age.12978

PubMed Abstract | Crossref Full Text | Google Scholar

13. Nielen AL, Knol BW, van Hagen MA, van der Gaag I. Genetic and epidemiological investigation of a birth cohort of boxers. Tijdschr Diergeneeskd. (2003) 128:586–90.

PubMed Abstract | Google Scholar

14. Wilke VL, Conzemius MG, Kinghorn BP, Macrossan PE, Cai W, Rothschild MF. Inheritance of rupture of the cranial cruciate ligament in Newfoundlands. J Am Vet Med Assoc. (2006) 228:61–64. doi: 10.2460/javma.228.1.61

PubMed Abstract | Crossref Full Text | Google Scholar

15. Binversie EE, Walczak BE, Cone SG, Baker LA, Scerpella TA, Muir P. Canine ACL rupture: a spontaneous large animal model of human ACL rupture. BMC Musculoskelet Disord. (2022) 23:116. doi: 10.1186/s12891-021-04986-z

PubMed Abstract | Crossref Full Text | Google Scholar

16. Karlsson EK, Lindblad-Toh K. Leader of the pack: gene mapping in dogs and other model organisms. Nat Rev Genet. (2008) 9:713–25. doi: 10.1038/nrg2382

PubMed Abstract | Crossref Full Text | Google Scholar

17. Baker LA, Momen M, McNally R, Berres ME, Binversie EE, Sample SJ, et al. Biologically enhanced genome-wide association study provides further evidence for candidate loci and discovers novel loci that influence risk of anterior cruciate ligament rupture in a dog model. Front Genet. (2021) 12:593515. doi: 10.3389/fgene.2021.593515

PubMed Abstract | Crossref Full Text | Google Scholar

18. Lee B, Baker L, Momen M, Terhaar H, Binversie EE, Sample SJ, et al. Identification of genetic variants associated with anterior cruciate ligament rupture and AKC standard coat color in the Labrador Retriever. BMC Genom Data. (2023) 24:60. doi: 10.1186/s12863-023-01164-z

PubMed Abstract | Crossref Full Text | Google Scholar

19. Meuwissen T, Hayes B, Goddard M. Genomic selection: a paradigm shift in animal breeding. Anim Front. (2016) 6:6–14. doi: 10.2527/af.2016-0002

Crossref Full Text | Google Scholar

20. Baker LA, Momen M, Chan K, Bollig N, Lopes FB, Rosa GJM, et al. Bayesian and machine learning models for genomic prediction of anterior cruciate ligament rupture in the canine model. G3 (Bethesda). (2020) 10:2619–28. doi: 10.1534/g3.120.401244

PubMed Abstract | Crossref Full Text | Google Scholar

21. Momen M, Muir P. Polygenic risk score prediction of complex diseases in companion animals: prospects, opportunities, and challenges. Am J Vet Res. (2025) 86:ajvr.25.01.0018. doi: 10.2460/ajvr.25.01.0018

PubMed Abstract | Crossref Full Text | Google Scholar

22. Chuang C, Ramaker MA, Kaur S, Csomos RA, Kroner KT, Bleedorn JA, et al. Radiographic risk factors for contralateral rupture in dogs with unilateral cranial cruciate ligament rupture. PLoS One. (2014) 9:e106389. doi: 10.1371/journal.pone.0106389

PubMed Abstract | Crossref Full Text | Google Scholar

23. Reif U, Probst CW. Comparison of tibial plateau angles in normal and cranial cruciate deficient stifles of Labrador Retrievers. Vet Surg. (2003) 32:385–9. doi: 10.1053/jvet.2003.50047

PubMed Abstract | Crossref Full Text | Google Scholar

24. Chang CC, Chow CC, Tellier LC, Vattikuti S, Purcell SM, Lee JJ. Second-generation PLINK: rising to the challenge of larger and richer datasets. GigaScience. (2015) 4:7. doi: 10.1186/s13742-015-0047-8

PubMed Abstract | Crossref Full Text | Google Scholar

25. Browning BL, Zhou Y, Browning SR. A one-penny imputed genome from next-generation reference panels. Am J Hum Genet. (2018) 103:338–48. doi: 10.1016/j.ajhg.2018.07.015

PubMed Abstract | Crossref Full Text | Google Scholar

26. Browning BL, Tian X, Zhou Y, Browning SR. Fast two-stage phasing of large-scale sequence data. Am J Hum Genet. (2021) 108:1880–90. doi: 10.1016/j.ajhg.2021.08.005

PubMed Abstract | Crossref Full Text | Google Scholar

27. Pérez P, de los Campos G. Genome-wide regression and prediction with the BGLR statistical package. Genetics. (2014) 198:483–95. doi: 10.1534/genetics.114.164442

PubMed Abstract | Crossref Full Text | Google Scholar

28. Hastie T, Tibshirani R, Friedman J. Elements of Statistical Learning: Data Mining, Inference, and Prediction. Second Edition. New York, NY: Springer (2009).

Google Scholar

29. Friedman J, Hastie T, Tibshirani R. Regularization paths for generalized linear models via coordinate descent. J Stat Softw. (2010) 33:1–22. doi: 10.18637/jss.v033.i01

PubMed Abstract | Crossref Full Text | Google Scholar

30. Zhao H, Williams GJ, Huang JZ. Wsrf: An R package for classification with scalable weighted subspace random forests. J Stat Softw. (2017) 77:1–30. doi: 10.18637/jss.v077.i03

Crossref Full Text | Google Scholar

31. Meyer D, Dimitriadou E, Hornik K, Weingessel A, Leisch F. Misc functions of the department of statistics, probability theory group (Formerly: E1071), TU Wien. In: CRAN: Contributed Packages. (1999). p. e1071. doi: 10.32614/CRAN.package.e1071

Crossref Full Text | Google Scholar

32. Perdry H, Dandine-Roulland C. gaston: Genetic Data Handling (QC, GRM, LD, PCA) & Linear Mixed Models. In: CRAN: Contributed Packages. (2015). doi: 10.32614/CRAN.package.gaston

Crossref Full Text | Google Scholar

33. Youden WJ. Index for rating diagnostic tests. Cancer. (1950) 3:32–5. doi: 10.1002/1097-0142(1950)3:1 < 32::AID-CNCR2820030106>3.0.CO;2-3

Crossref Full Text | Google Scholar

34. Kubat M, Holte RC, Matwin S. Machine learning for the detection of oil spills in satellite radar images. Mach Learn. (1998) 30:195–215. doi: 10.1023/A:1007452223027

Crossref Full Text | Google Scholar

35. Fawcett T. An introduction to ROC analysis. Pattern Recognit Lett. (2006) 27:861–74. doi: 10.1016/j.patrec.2005.10.010

Crossref Full Text | Google Scholar

36. Hand DJ, Till RJ. A simple generalisation of the area under the ROC curve for multiple class classification problems. Mach Learn. (2001) 45:171–86. doi: 10.1023/A:1010920819831

Crossref Full Text | Google Scholar

37. McClish DK. Analyzing a portion of the ROC curve. Med Decis Mak. (1989) 9:190–5. doi: 10.1177/0272989X8900900307

PubMed Abstract | Crossref Full Text | Google Scholar

38. Robin X, Turck N, Hainard A, Tiberti N, Lisacek F, Sanchez J-C, et al. pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinformatics. (2011) 12:77. doi: 10.1186/1471-2105-12-77

PubMed Abstract | Crossref Full Text | Google Scholar

39. Magee L. R² measures based on Wald and likelihood ratio joint significance tests. Am Stat. (1990) 44:250–253. doi: 10.2307/2685352

Crossref Full Text | Google Scholar

40. Rees H, Maddala GS. Limited-dependent and qualitative variables in econometrics. Econ J. (1985) 95:493–4. doi: 10.2307/2233228

Crossref Full Text | Google Scholar

41. Cox DR, Snell EJ. Analysis of Binary Data. Second edition. London; New York: Chapman and Hall (1989).

Google Scholar

42. Efron B. Regression and ANOVA with zero-one data: measures of residual variation. J Am Stat Assoc. (1978) 73:113–21. doi: 10.1080/01621459.1978.10480013

Crossref Full Text | Google Scholar

43. Nagelkerke NJD. A note on a general definition of the coefficient of determination. Biometrika. (1991) 78:691–2. doi: 10.2307/2337038

Crossref Full Text | Google Scholar

44. Rayward RM, Thomson DG, Davies JV, Innes JF, Whitelock RG. Progression of osteoarthritis following TPLO surgery: a prospective radiographic study of 40 dogs. J Small Anim Pract. (2004) 45:92–7. doi: 10.1111/J.1748-5827.2004.TB00209.X

PubMed Abstract | Crossref Full Text | Google Scholar

45. Kidenya BR, Mboowa G. Unlocking the future of complex human diseases prediction: multi-omics risk score breakthrough. Front Bioinform. (2024) 4:1510352. doi: 10.3389/fbinf.2024.1510352

PubMed Abstract | Crossref Full Text | Google Scholar

46. DeForge TL, Momen M, Conidi G, Muir P, Sample SJ. Age of neutering contributes to risk of cruciate ligament rupture in Labrador Retrievers. J Am Vet Med Assoc. (2024) 263:318–22. doi: 10.2460/javma.24.06.0406

PubMed Abstract | Crossref Full Text | Google Scholar

47. Chatterjee N, Shi J, García-Closas M. Developing and evaluating polygenic risk prediction models for stratified disease prevention. Nat Rev Genet. (2016) 17:392–406. doi: 10.1038/nrg.2016.27

PubMed Abstract | Crossref Full Text | Google Scholar

48. Bakker OB, Claringbould A, Westra H-J, Wiersma H, Boulogne F, Võsa U, et al. Identification of rare disease genes as drivers of common diseases through tissue-specific gene regulatory networks. Sci Rep. (2024) 14:30206. doi: 10.1038/s41598-024-80670-1

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: cruciate ligament rupture, dog, genome-wide association study, genomic prediction, polygenic risk score prediction, Labrador Retriever

Citation: Miranda B, Momen M, Sample SJ and Muir P (2025) Accuracy of genome-enabled polygenic risk score prediction of cruciate ligament rupture risk in the Labrador Retriever. Front. Vet. Sci. 12:1625953. doi: 10.3389/fvets.2025.1625953

Received: 12 May 2025; Accepted: 17 July 2025;
Published: 26 August 2025.

Edited by:

Rody Artigas, Universidad de la República, Uruguay

Reviewed by:

Yi Pan, University of Missouri, United States
Eugenio Jara, Eugenio Jara, Uruguay

Copyright © 2025 Miranda, Momen, Sample and Muir. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Peter Muir, cGV0ZXIubXVpckB3aXNjLmVkdQ==; Mehdi Momen, bW1vbWVuQHdpc2MuZWR1

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.