- 1Department of Biomedical Engineering, Graduate School, Pusan National University, Busan, Republic of Korea
- 2Department of Biomedical Engineering, School of Medicine, Pusan National University, Busan, Republic of Korea
- 3In Silico Medicine Lab, Biomedical Research Institute, Pusan National University Hospital, Busan, Republic of Korea
Introduction: This study conducted verification, validation, and uncertainty quantification of finite element analysis (FEA) results for pedicle screw assemblies subjected to static flexion and extension loading conditions, in accordance with the ASTM F1717-15 standard.
Methods: Four screw configurations and two material types, titanium alloy and titanium grade 23, were modeled to replicate the experimental setup. Simulations incorporated five types of contact conditions at the screw–block interface: frictionless; coefficient of friction (COF) of 0.1, 0.2, or 0.5; and bonded.
Results: Experimental results demonstrated high repeatability, with maximum deviations of 6.4% for stiffness, 9.1% for yield displacement, 7.2% for yield force, and 7.5% for force at 20 mm displacement. The FEA results qualitatively captured the experimental trends but quantitatively overestimated mechanical responses, particularly under bonded contact conditions. The largest prediction errors were 19.8% for stiffness, 21.5% for yield force, and 18.4% for force at 20 mm, while the greatest deviation in yield displacement, 14.2%, occurred under frictionless conditions. When targeting an ASME VandV 40 Level 3 agreement (difference <10%), we found that no single interface model satisfied all validation metrics across screws and loading modes. When construct stiffness and force at 20 mm were prioritized as primary validation metrics, COF values in the range 0.10–0.20 yielded the most consistent agreement with experiment. Sensitivity analysis revealed that force and stiffness outputs were highly influenced by contact assumptions, whereas displacement outputs exhibited moderate sensitivity.
Discussion: These findings highlight the importance of accurately defining contact conditions and experimentally characterizing interface behavior. To ensure predictive accuracy and regulatory relevance, validated friction models should be applied and contact parameters precisely calibrated, which can enhance the credibility of spinal biomechanics simulations. Relative to prior F1717-based simulations, this work quantifies the dominant impact of interface modeling and provides actionable parameter bounds for validation (μ = 0.10–0.20 for the tested constructs).
1 Introduction
Pedicle screws are widely used in spinal fusion, fracture stabilization, tumor resection, and corrective spinal procedures, serving as critical anchors to stabilize and realign the spine (Park et al., 2024; Jiang et al., 2025). Their mechanical performance is therefore essential for successful clinical outcomes.
In Korea, the Ministry of Food and Drug Safety (MFDS) requires mechanical performance testing for regulatory approval, typically following ASTM F1717 flexion, extension, rotation, and fatigue protocols (Baumann et al., 2021; Biswas et al., 2019; Rana et al., 2020; ASTM F1717 Standard, 2015, Ministry of Food and Drug Safety, 2025). The standard mandates five repetitions per mode, totaling 20 tests, each costing approximately USD 1,000 in materials alone, which represents a substantial burden for small manufacturers, particularly when retesting is required after a failure.
Computational Modeling and Simulation (CM&S) offers a promising route to reduce the time and cost of this process (Morrison et al., 2018). However, replacing physical tests with simulations requires demonstrated model credibility through rigorous Verification and Validation (VandV) and Uncertainty Quantification (UQ) (US Food and Drug Administration, 2016; US Food and Drug Administration, 2023). Relevant frameworks include ASME VandV 10 for computational solid mechanics and ASME VandV 40 for risk-informed assessment of medical device models (ASME V&V 10-2006, 2016; ASME V&V 40-2018, 2018).
Based on these standards, this study focuses on the uncertainty quantification necessary for applying computational models as a substitute for real mechanical testing. Following the ASME VandV 40–2018 guidelines, the study aims to identify and evaluate sources of uncertainty to ensure the credible use of computational modeling and simulation for pedicle screw performance evaluation.
2 Methods
2.1 Question of interest (?OI) and context of use (COU)
Question of Interest (?OI) as the specific question, decision, or concern that the computational model aims to address within its designated Context of Use (COU). The ?OI serves as the foundation for assessing model risk and establishing the credibility goals required for the model.
The COU defines the specific role and scope of the computational model used to address the ?OI. It should include a detailed statement of what will be modeled and how the outputs from the computational model will be used to answer or inform the ?OI.
The ?OI in this study is the feasibility of using computational modeling and simulation as a reliable alternative to ASTM F1717 experimental testing for pedicle screw regulatory approval. The context of use (COU) involves pedicle screws, which are primarily used in spinal fusion, fracture stabilization, tumor resection, and corrective surgeries. These screws function as anchors within the vertebrae to stabilize the spine. Given their critical role, the safety and performance of pedicle screws are essential considerations. Accordingly, for regulatory approval, their mechanical behavior is evaluated by assembling them into a construct and testing them according to ASTM F1717 guidelines.
2.2 Model risk
Model risk is the possibility that the use of the computational model leads to a decision that results in patient harm and/or other undesirable impacts (Figure 1).
2.2.1 Model influence
Model influence is the contribution of the computational model relative to other contributing evidence in making a decision. In this study, we set the model influence to medium because the simulation outputs from the computational model are a moderate factor in the decision (Smith et al., 2012; Dickman et al., 1992).
2.2.2 Decision consequence
Decision consequence is the significance of an adverse outcome resulting from an incorrect decision. Consequences are typically considered in the context of potential harm to the patient (Stone, 2017; Bujold et al., 2022). In this study, we set the decision consequence to medium because an incorrect decision could result in minor patient injury or the need for physician intervention, or have other moderate impacts.
2.3 Model credibility
Model credibility refers to the level of trust in a computational model’s ability to accurately predict outcomes within its specified Context of Use (COU). It essentially determines whether the model is reliable enough to support decision-making processes, particularly in critical applications such as medical device development, where patient safety or performance outcomes are at stake (Figure 1) (Aldieri et al., 2023; Morrison et al., 2019).
Model credibility is typically established through verification and validation (VandV) activities. Verification ensures that the model’s implementation is correct, essentially confirming that the model was built correctly (Jeong et al., 2023). Validation determines whether the model accurately represents the real-world system it aims to simulate, ensuring that the right model was built (Courcelles et al., 2024).
2.4 Verification
2.4.1 Code verification
Computational analyses were conducted using the commercially available finite element analysis (FEA) software platform, ANSYS Mechanical. The code verification activities were performed at levels that met the required model risk threshold, as the commercial FEA software employed established and documented software quality assurance procedures. Additionally, the release notes and anomaly lists associated with the software version ANSYS 2024 R2 were reviewed. No issues were identified in the software code that would affect the validity of this study.
2.4.2 Calculation verification
Discretization error analysis was performed for the ASTM Standard F1717 pedicle screw assembly model. The pedicle screw assembly model consists of Ultra-High Molecular Weight Polyethylene (UHMWPE) blocks, pedicle screws, and spinal rods. Mesh refinement on the blocks, rods, and screws was performed to determine its impact on the maximum von Mises stress and reaction force in the assembly model. Construct stiffness was calculated as the slope of the elastic region, ranging from 2 mm to 10 mm displacement. Yield force was calculated using the 2% offset method, as specified in the ASTM Standard F1717. Additionally, the force at 20 mm displacement provided a post-yield quantity of interest (QOI).
2.5 Validation
2.5.1 Computational model
The computational model of the pedicle screw was reverse-engineered by acquiring a commercial pedicle screw and performing a 3D scan (Park et al., 2025). The UHMWPE block was modeled in 3D according to specifications, while the spinal rod was designed as a 3D model with a diameter of 6 mm and a length of 110 mm. And it was assembled in accordance with ASTM Standard F1717. The Finite Element (FE) model was constructed using the tetrahedral element C3D10 (Sim et al., 2022). This element type was selected to accurately represent the threads of the pedicle screw (Figure 2) (Kim et al., 2022).
2.5.1.1 Model form
As shown in Figure 2, the pin and jig were not explicitly modeled. Instead, a remote point was assigned, and boundary conditions were directly applied to the surface of the block where the pin had been inserted. With the lower pin fixed, an axial displacement was applied to the upper pin to simulate static compression and tension tests. Each block was able to rotate around the pin axis.
2.5.1.2 Model inputs
The UHMWPE blocks were modeled using a response function based on uniaxial tension test data from the ASTM Standard D638-14 test (ASTM D638-14 Standard, 2015). Spinal rods were simulated using a bilinear elastic-plastic material model based on the ASTM Standard E8 tensile test (ASTM E8 Standard, 2022). Pedicle screws were modeled with the same properties as spinal rods, as they must be manufactured from Ti-6Al-4V ELI (Titanium Grade 23) in accordance with approved material specifications (Table 1).
To further examine differences in material properties, we additionally incorporated the titanium alloy material properties provided by the ANSYS engineering database into the simulation. The simulation utilized implicit analysis, including the effects of large deformation.
To evaluate the influence of various model form assumptions on the mechanical performance of the construct, additional simulations were conducted. These simulations focused on key assumptions, including the boundary conditions at the screw-block interfaces and the material properties of the pedicle screw (Çetin and Bircan, 2021; Xu et al., 2019; Chen et al., 2003). At the screw-block interface, frictional contact with coefficients of 0.1, 0.2, and 0.5 was prescribed, informed by tribology reports for Ti–UHMWPE pairs (Leksycki et al., 2022) and recent F1717 modeling practice (Nagaraja et al., 2024). Bonded and frictionless contact were additionally analyzed as bounding cases to evaluate contact-assumption sensitivity.
2.5.1.2.1 Quantification of sensitivities
A sensitivity analysis was performed to investigate the influence of variations in the coefficient of friction and Young’s modulus. Five different contact conditions were applied to two material types, namely, titanium alloy and titanium grade 23.
2.5.1.2.2 Quantification of uncertainties
This component aims to investigate how uncertainties in the defined input parameters, specifically the coefficient of friction and Young’s modulus, are propagated through the simulation and influence the resulting outcomes.
2.5.2 Comparator
The ASTM Standard F1717 static compression and tension tests were conducted using pedicle screws approved for surgical use in hospitals.
2.5.2.1 Test samples
2.5.2.1.1 Quantity of test samples
Each test was performed five times in accordance with ASTM Standard F1717 recommendations.
2.5.2.1.2 Range of characteristics of test samples
The entire construct was assembled in accordance with ASTM Standard F1717 specifications. The spinal components were not intentionally fabricated at the extreme limits of specification tolerances; rather, they were assumed to fall within nominal dimensional ranges appropriate for standard testing protocols.
The spinal fusion construct consisted of a 6 mm diameter, 110 mm long medical-grade spinal rod (Ti-6Al-4V ELI), a UHMWPE block manufactured in compliance with ASTM Standard F1717, and mono-axial pedicle screws (Ti-6Al-4V ELI).
Four pedicle screw types (diameter × length) were evaluated: 6 × 40 mm, 6 × 50 mm, 7 × 40 mm, and 7 × 50 mm (Table 2).
2.5.2.1.3 Measurements of test samples
Prior to testing, dimensional measurements were performed to ensure construct consistency (Kim et al., 2023). Specifically, the distance from the center of the upper pin axis to the center of the lower pin axis, as well as the distance between the upper and lower screws, was measured using a vernier caliper. To enhance measurement reliability, at least two independent observers performed the assessments.
2.5.2.1.4 Uncertainty of test sample measurements
The single test method for static compression and tension was conducted using a calibrated universal testing machine (Shimadzu Corporation) at a speed of 25 mm/min, in accordance with ASTM Standard F1717-15, with a maximum displacement of 20 mm. Force–displacement data were recorded at 100 Hz (Figure 3).
2.5.2.2 Test conditions
2.5.2.2.1 Quantity of test conditions
In accordance with ASTM Standard F1717-15, the test was conducted under two test conditions (n = 2), compression and tension, at room temperature, using a loading speed of 25 mm/min and a maximum displacement of 20 mm.
2.5.2.2.2 Range of test conditions
Two tests were performed within a range of conditions near the nominal values.
2.5.2.2.3 Measurements of test conditions
This evaluation criterion assesses the extent to which test conditions were quantitatively and systematically measured. In this case, the test conditions, including temperature, loading speed, and displacement, were quantitatively defined and controlled.
2.5.2.2.4 Uncertainty of test condition measurements
Environmental conditions were monitored and controlled with appropriate instrumentation: temperature was measured using a thermo-hygrometer, and loading speed and displacement were controlled using a universal testing machine that had been calibrated and verified by a qualified engineer. Consequently, a comprehensive quantification of uncertainty, encompassing factors such as repeatability, was achieved.
2.6 Verification activities
2.6.1 Code verification
2.6.1.1 Software quality assurance (SQA)
SQA procedures were clearly defined and systematically documented. The software anomaly list and development environment were thoroughly assessed to ensure complete understanding, and their impact on the COU was analyzed and recorded. Additionally, quality metrics were continuously monitored and tracked to uphold rigorous standards throughout the development process.
2.6.1.2 Numerical code verification (NCV)
Benchmark verification problems were conducted to validate the proper functioning of software installations on their respective hardware systems. The obtained results demonstrated consistency with established benchmark problem solutions and exhibited convergence with mesh refinement, confirming the accuracy and reliability of the computational framework.
2.6.2 Calculation verification
Calculation verification was performed through analyses of discretization error, numerical solver error (NSE), and use error.
2.6.2.1 Discretization error
To verify discretization error, the reaction force, which is identified as a quantity of interest (QOI) in the ASTM Standard F1717-15 assembly, has been observed to converge with a variation of less than 5% through mesh refinement applied to the UHMWPE block, spinal rod, and pedicle screw (Figure 4). In the final convergence step, the percent difference in reaction force between Case 8 and Case 9 was 0.04%, which is well below the target accuracy threshold of 5%. Therefore, Case 8 was selected for all subsequent simulations.
Convergence was achieved with an average element edge length of 3 mm for the UHMWPE block, 1 mm for the spinal rod, and 1 mm for the pedicle screw. Additionally, an element edge length of 1 mm was assigned separately for the contact surface between the UHMWPE block and the pedicle screw (Table 3).
2.6.2.2 Numerical solver error (NSE)
Numerical solver error was quantified by varying the force and displacement convergence tolerances in the software. Simulations were conducted by increasing the convergence tolerances for force and displacement by factors of 2 and 4, respectively, relative to the default solver settings. The target numerical accuracy for the analysis was pre-specified as less than 5%, in line with commonly accepted standards for orthopedic finite element analyses.
2.6.2.3 Use error
In order to identify potential use errors associated with the simulations, an internal peer review was conducted on the simulations. Key model inputs were verified, and no input errors were detected in any simulation.
2.7 Validation activities
The experimental and simulation results are presented in Figures 5–8 and Tables 4–7. For each screw type (640, 650, 740, and 750), five flexion tests and five extension tests were performed. Values in brackets indicate the percent (%) difference relative to the average, computed from all measurements.
Figure 5. Pedicle screw 640 Graph: Flexion, (a) All Results; (b) Tests; (c) Titanium Alloy Simulations; (d) Titanium Grade 23 Simulations; (e) Stiffness; (f) Yield Displacement; (g) Yield Force; (h) Force at 20 mm; Extension, (i) All Results; (j) Tests; (k) Titanium Alloy Simulations; (l) Titanium Grade 23 Simulations; (m) Stiffness; (n) Yield Displacement; (o) Yield Force; (p) Force at 20 mm.
Figure 6. Pedicle screw 650 Graph: Flexion, (a) All Results; (b) Tests; (c) Titanium Alloy Simulations; (d) Titanium Grade 23 Simulations; (e) Stiffness; (f) Yield Displacement; (g) Yield Force; (h) Force at 20 mm; Extension, (i) All Results; (j) Tests; (k) Titanium Alloy Simulations; (l) Titanium Grade 23 Simulations; (m) Stiffness; (n) Yield Displacement; (o) Yield Force; (p) Force at 20 mm.
Figure 7. Pedicle screw 740 Graph: Flexion, (a) All Results; (b) Tests; (c) Titanium Alloy Simulations; (d) Titanium Grade 23 Simulations; (e) Stiffness; (f) Yield Displacement; (g) Yield Force; (h) Force at 20 mm; Extension, (i) All Results; (j) Tests; (k) Titanium Alloy Simulations; (l) Titanium Grade 23 Simulations; (m) Stiffness; (n) Yield Displacement; (o) Yield Force; (p) Force at 20 mm.
Figure 8. Pedicle screw 750 Graph: Flexion, (a) All Results; (b) Tests; (c) Titanium Alloy Simulations; (d) Titanium Grade 23 Simulations; (e) Stiffness; (f) Yield Displacement; (g) Yield Force; (h) Force at 20 mm; Extension, (i) All Results; (j) Tests; (k) Titanium Alloy Simulations; (l) Titanium Grade 23 Simulations; (m) Stiffness; (n) Yield Displacement; (o) Yield Force; (p) Force at 20 mm.
2.7.1 Model form analysis
Sensitivity analysis was conducted to investigate the contact and interaction conditions between the UHMWPE block and the pedicle screw. In this study, the thread geometry of the pedicle screw was accurately modeled in three dimensions using 3D scanning equipment to replicate the actual screw shape. Two different material types, titanium alloy and titanium grade 23, were used to define the mechanical properties of the pedicle screw.
Five contact conditions between the UHMWPE block and the pedicle screw were considered: frictionless, coefficient of friction (COF) values of 0.1, 0.2, 0.5, and bonded. Additionally, the interface between the pedicle screw and spinal rod was modeled as bonded.
The experimental and simulation results indicated that the frictionless contact condition between the UHMWPE block and the pedicle screw had the most significant impact, resulting in a maximum stiffness reduction of 11.7% compared to the bonded condition. In terms of yield displacement, the frictionless condition showed a maximum increase of 11.5%. Yield force exhibited a maximum reduction of 7.7%, and the force at 20 mm decreased by up to 10% under the frictionless condition relative to the bonded condition.
2.7.2 Sensitivity analysis
The sensitivity analysis results were obtained by applying five different contact conditions to two material types (titanium alloy and titanium grade 23). The variable having the greatest impact on stiffness was the contact condition, showing a maximum difference of 19.8% compared to the experimental mean value. The contact condition also had the most significant effect on yield displacement, with a maximum difference of 14.2%. Similarly, yield force showed a maximum variation of 21.5%, and the force at 20 mm exhibited a maximum difference of 18.4%, both due to variations in the contact condition.
3 Results
3.1 Static compression and tension data
3.1.1 Pedicle screw 640 compression and tension
Under the 640 flexion condition, the experimental averages were: stiffness 48.36 N/mm, yield displacement 18.30 mm, yield force 811.67 N, and force at 20 mm 864.08 N. As contact constraint increased from frictionless to frictional to bonded, the model predicted higher stiffness, yield force, and force at 20 mm, with a slight decrease in yield displacement. For the Ti alloy model, stiffness increased from 47.13 to 53.00 N/mm, yield force from 822.00 to 891.00 N, and the force at 20 mm from 848.58 to 933.66 N, while yield displacement decreased from 18.95 to 18.32 mm. For Ti Grade 23, stiffness rose from 48.60 to 54.40 N/mm, yield force from 853.00 to 907.20 N, and the force at 20 mm from 877.07 to 952.44 N, while yield displacement decreased from 19.06 to 18.19 mm.
Under 640 extension, experiment averaged stiffness 54.55 N/mm and force at 20 mm 1,033.14 N (one specimen: 20.00 mm yield displacement, 955.88 N yield force). In simulation, tightening the contact from frictionless → frictional → bonded increased stiffness and the force at 20 mm and slightly reduced yield displacement. Ti alloy: stiffness 55.21→62.55 N/mm, yield displacement 22.41→21.32 mm, yield force 1,154.67→1,240.00 N, force at 20 mm 1,070.00→1,189.40 N. Ti Grade 23: stiffness 57.05→64.35 N/mm, yield displacement 23.04→21.30 mm, yield force 1,229.33→1,274.67 N, force at 20 mm 1,116.60→1,223.00 N.
3.1.2 Pedicle screw 650 compression and tension
Under the 650 flexion condition, the experimental averages were: stiffness 45.83 N/mm, yield displacement 17.77 mm, yield force 744.60 N, and force at 20 mm 804.46 N. As contact constraint increased from frictionless to frictional to bonded, the model predicted higher stiffness, yield force, and force at 20 mm, with a slight decrease in yield displacement. For the Ti alloy model, stiffness increased from 47.73 to 53.28 N/mm, yield force from 836.20 to 891.60 N, and the force at 20 mm from 860.44 to 934.78 N, while yield displacement decreased from 19.03 to 18.25 mm. For Ti Grade 23, stiffness rose from 49.20 to 54.66 N/mm, yield force from 857.80 to 903.40 N, and the force at 20 mm from 883.98 to 950.36 N, while yield displacement decreased from 18.95 to 18.04 mm.
Under 650 extension, experiment averaged stiffness 54.70 N/mm and force at 20 mm 1,049.55 N. In simulation, tightening the contact from frictionless → frictional → bonded increased stiffness and the force at 20 mm and slightly reduced yield displacement. Ti alloy: stiffness 56.05→62.82 N/mm, yield displacement 22.53→21.15 mm, yield force 1,179.00→1,234.67 N, force at 20 mm 1,090.10→1,191.20 N. Ti Grade 23: stiffness 57.05→64.35 N/mm, yield displacement 22.68→20.93 mm, yield force 1,225.67→1,254.67 N, force at 20 mm 1,127.20→1,218.80 N.
3.1.3 Pedicle screw 740 compression and tension
Under the 740 flexion condition, the experimental averages were: stiffness 54.45 N/mm, yield displacement 17.26 mm, yield force 857.00 N, and force at 20 mm 932.62 N. As contact constraint increased from frictionless to frictional to bonded, the model predicted higher stiffness, yield force, and force at 20 mm, with a slight decrease in yield displacement. For the Ti alloy model, stiffness increased from 51.41 to 56.75 N/mm, yield force from 886.20 to 923.20 N, and the force at 20 mm from 916.79 to 975.16 N, while yield displacement decreased from 18.75 to 17.78 mm. For Ti Grade 23, stiffness rose from 52.52 to 57.79 N/mm, yield force from 888.80 to 925.20 N, and the force at 20 mm from 926.36 to 982.32 N, while yield displacement decreased from 18.44 to 17.52 mm.
Under 740 extension, experiment averaged stiffness 63.39 N/mm, yield displacement 19.75 mm, yield force 1,155.33 N, and force at 20 mm 1,163.14 N. In simulation, tightening the contact from frictionless → frictional → bonded increased stiffness and the force at 20 mm and slightly reduced yield displacement. Ti alloy: stiffness 60.42→67.17 N/mm, yield displacement 22.24→20.44 mm, yield force 1,252.00→1,271.00 N, force at 20 mm 1,170.00→1,254.20 N. Ti Grade 23: stiffness 61.75→68.53 N/mm, yield displacement 21.66→19.95 mm, yield force 1,243.33→1,263.50 N, force at 20 mm 1,183.10→1,265.10 N.
3.1.4 Pedicle screw 750 compression and tension
Under the 750 flexion condition, the experimental averages were: stiffness 52.03 N/mm, yield displacement 17.11 mm, yield force 811.20 N, and force at 20 mm 888.73 N. As contact constraint increased from frictionless to frictional to bonded, the model predicted higher stiffness, yield force, and force at 20 mm, with a slight decrease in yield displacement. For the Ti alloy model, stiffness increased from 53.39 to 58.40 N/mm, yield force from 890.80 to 927.40 N, and the force at 20 mm from 933.23 to 986.73 N, while yield displacement decreased from 18.20 to 17.40 mm. For Ti Grade 23, stiffness rose from 54.56 to 59.48 N/mm, yield force from 894.00 to 930.00 N, and the force at 20 mm from 942.09 to 993.24 N, while yield displacement decreased from 17.90 to 17.15 mm.
Under 750 extension, experiment averaged stiffness 61.73 N/mm, yield displacement 18.93 mm, yield force 1,074.70 N, and force at 20 mm 1,112.73 N. In simulation, tightening the contact from frictionless → frictional → bonded increased stiffness and the force at 20 mm and slightly reduced yield displacement. Ti alloy: stiffness 62.60→69.32 N/mm, yield displacement 21.34→19.75 mm, yield force 1,241.17→1,263.73 N, force at 20 mm 1,193.10→1,273.50 N. Ti Grade 23: stiffness 64.03→70.73 N/mm, yield displacement 20.77→19.26 mm, yield force 1,232.37→1,255.33 N, force at 20 mm 1,205.30→1,283.60 N.
3.2 Assessment
3.2.1 Equivalency of input parameters
The input parameters, including the material properties of the UHMWPE block and spinal rod, as well as temperature, loading speed, and the type and range of displacement control, were consistently maintained. However, in the case of the pedicle screw, although Titanium Grade 23 specified by the manufacturer was used, differences were observed between the outcomes of the comparator and the computational model.
3.2.2 Output comparison
3.2.2.1 Quantity
A total of four output variables were compared: stiffness, yield displacement, yield force, and force at 20 mm.
3.2.2.2 Equivalency of output parameters
Types of outputs were equivalent.
3.2.2.3 Rigor of output comparison
To ensure a rigorous evaluation, the comparison against the comparator incorporated the output uncertainty inherent in the computational model.
3.2.2.4 Agreement of output comparison
A qualitative agreement was confirmed between the quantities of interest (QOIs) produced by the computational model and those derived from the comparator.
3.3 Applicability of the validation activities to the COU
3.3.1 Relevance of the QOIs
The QOIs from the validation activities were identical to those for the COU.
3.3.2 Relevance of the validation activities to the COU
The COU encompassed some of the validation points.
4 Discussion
4.1 Key findings
This study evaluated the mechanical behavior of spinal constructs under static compression (flexion) and static tension (extension) conditions using four different pedicle screw configurations. Experimental testing demonstrated consistent results across all loading modes, with minimal variability among the five specimens tested in each group, confirming the reliability of the experimental setup. In this study, the rigor of output comparison and agreement of output comparison was required to satisfy Level 3, corresponding to comparison by measuring the difference between computational results and experimental data, with differences being less than 10%.
Under tension conditions, some of the test and computational model results for yield displacement and yield force exceeded the 20 mm displacement threshold. These values were subsequently estimated through simulation to predict the expected mechanical behavior. Comparative analysis between experimental and computational results revealed consistent trends across both material types, titanium alloy and titanium grade 23. For both static compression and tension scenarios, the finite element models generally predicted higher stiffness and force values than the experimental results, with the discrepancies most pronounced under bonded contact conditions. Titanium grade 23 consistently exhibited slightly higher stiffness, yield force, and force at 20 mm than titanium alloy under identical contact conditions, reflecting its superior material properties.
4.2 Relation to the state of the art
Recent F1717-based computational studies frequently employ simplified or single-valued interface assumptions, which can mask the dominant role of contact in construct response (Chen et al., 2003; Xu et al., 2019; Chu et al., 2019; La Barbera et al., 2014). Our results extend this line of work by quantifying how interface modeling governs agreement, in line with contemporary credibility frameworks (Nagaraja et al., 2024; ASME V&V 40-2018, 2018; US Food and Drug Administration, 2023): across screws and loading modes, no single interface model satisfied all validation metrics simultaneously (Xu et al., 2019; La Barbera et al., 2014), and when construct stiffness and force at 20 mm are prioritized, a coefficient of friction of 0.10–0.20 produced the most consistent agreement with experiment, consistent with prevailing practice and tribological evidence for Ti-6Al-4V/UHMWPE pairs (Eastlack et al., 2023; Chen et al., 2003; Leksycki et al., 2022).
4.3 Quantitative agreement relative to a predefined target
We explicitly targeted ASME VandV 40 Level 3 comparison (model–experiment difference <10%). By screw type and loading mode, multiple contact settings (frictionless; μ = 0.1, 0.2, or 0.5; and bonded) met this criterion for stiffness and force at 20 mm, whereas bonded interfaces frequently exceeded it most notably for yield force. This pattern indicates that over-constrained interfaces inflate apparent construct strength and stiffness and should not be used as defaults when validating spinal construct models.
4.4 Contact effects and material trends
The influence of contact assumptions on the mechanical behavior was clearly demonstrated. As the coefficient of friction decreased from bonded to frictionless conditions, both stiffness and force at 20 mm displacement consistently decreased, while yield displacement increased. For the titanium alloy model, stiffness ranged from 47.13 to 62.60 N/mm under frictionless conditions and from 53.00 to 69.32 N/mm under bonded conditions. Similarly, for the titanium grade 23 model, stiffness ranged from 48.60 to 64.03 N/mm (frictionless) to 54.40–70.73 N/mm (bonded). Yield displacement exhibited an inverse trend relative to stiffness. For the titanium alloy model, yield displacement ranged from 18.20 to 22.53 mm under frictionless conditions and from 17.40 to 21.32 mm under bonded conditions. For titanium grade 23, yield displacement ranged from 17.90 to 23.04 mm (frictionless) to 17.15–21.30 mm (bonded).
In terms of yield force, the titanium alloy model produced values between 822.00 and 1,252.00 N under frictionless conditions and between 891.00 and 1,271.00 N under bonded conditions. Similarly, titanium grade 23 showed yield force values ranging from 853.00 to 1,243.33 N (frictionless) and from 903.40 to 1,274.67 N (bonded).
The force at 20 mm displacement followed the same trend as stiffness and yield force. For titanium alloy, force at 20 mm ranged from 848.58 to 1,193.10 N (frictionless) and from 933.66 to 1,273.50 N (bonded). Titanium grade 23 exhibited force at 20 mm values ranging from 877.07 to 1,205.30 N (frictionless) and from 950.36 to 1,283.60 N (bonded).
4.5 Per-screw validation summary
For the 640 screw under flexion, simulation results within 10% deviation from experimental data were observed across all contact conditions, namely, Frictionless, COF 0.1, COF 0.2, COF 0.5, and Bonded, for the titanium alloy. In the case of titanium grade 23, the criterion was satisfied under Frictionless, COF 0.1, COF 0.2, and COF 0.5 contact conditions. Under extension, only one case each was available for yield displacement and yield force, so these outputs were excluded from the comparison. Among the remaining metrics, the titanium alloy satisfied the 10% criterion under Frictionless, COF 0.1, COF 0.2, and COF 0.5 contact conditions. The titanium grade 23 met the threshold under Frictionless, COF 0.1, and COF 0.2.
For the 650 screw under flexion, no condition satisfied the 10% deviation criterion across all output variables. Although stiffness, yield displacement, and force at 20 mm showed acceptable values, the yield force exceeded 10% under all contact conditions. For extension, since yield displacement and yield force were not available, the comparison was based on the remaining outputs. In this case, the titanium alloy satisfied the 10% criterion under Frictionless, COF 0.1, COF 0.2, and COF 0.5, while titanium grade 23 met the threshold under Frictionless, COF 0.1, and COF 0.2.
For the 740 screw in flexion, all contact models for both materials were within 10% of experiment. In extension, the titanium alloy met the 10% threshold only with bonded contact, while titanium grade 23 met it with frictionless, COF 0.1, COF 0.2, and bonded contact.
For the 750 screw under flexion, titanium alloy frictionless contact conditions satisfied the 10% criterion. Although stiffness, yield displacement, and force at 20 mm showed acceptable values excluding the bonded condition, yield force exceeded the 10% threshold under all other conditions. For the extension mode, stiffness and force at 20 mm met the 10% criterion under Frictionless, COF 0.1, and COF 0.2 for titanium alloy, and under Frictionless, COF 0.1, and COF 0.2 for titanium grade 23. However, for yield displacement and yield force, the titanium alloy under the Frictionless condition showed deviations of 12.7% and 15.5%, respectively, and 4.3% and 17.6% under the Bonded condition. Titanium grade 23 exceeded 9.7% and 14.7% under the Frictionless condition, and 1.7% and 16.8% under the Bonded condition, respectively.
Collectively, these per-screw results indicate that no single contact model satisfied all output metrics across all screw types and loading modes; when construct stiffness and force at 20 mm displacement are adopted as the primary validation metrics, a coefficient of friction of 0.10–0.20 is recommended.
4.6 Uncertainty and robustness
Because screw geometries were reverse-engineered from commercial implants, exact material parameters (e.g., E, σy, hardening) and screw–block contact properties (friction law and μ) represent epistemic uncertainties. Treating these as uncertain inputs clarifies why some cases align with Grade 23 properties while others are better matched by a lower effective modulus. We therefore recommend explicit reporting and justification of contact/material assumptions and, where feasible, experimental calibration of μ and material parameters to strengthen reproducibility.
4.7 Practical implication
For F1717 simulations intended for decision-making or regulatory submissions, we recommend avoiding fully bonded interfaces as a default, documenting the friction law and μ, and selecting values preferably μ = 0.10–0.20 that are justified by experiment for the specific construct. Overall, the computational models qualitatively captured experimental trends, but tended to overestimate force- and stiffness-related outputs under bonded assumptions, whereas displacement-related outputs showed moderate sensitivity. Overly simplified interfaces may thus lead to significant overestimations; future work should incorporate experimentally validated friction models and refined interface characterizations to enhance simulation fidelity.
5 Limitations
This study followed the ASTM F1717-15 standard configuration, which uses UHMWPE surrogate blocks and intentionally idealizes the bone implant environment. As such, the accuracy reported here should not be over generalized to vertebral anatomy, bone heterogeneity, or fusion interfaces. Our tests and simulations were limited to static compression (flexion) and static tension (extension); dynamic loading and fatigue behavior were not assessed, so durability under cyclic conditions remains outside the present scope.
The pedicle screw geometries were reconstructed from commercial implants, so the exact material properties (for example, Young’s modulus, yield strength, and hardening behavior) were not directly measured and should be regarded as epistemic uncertainties. In addition, the screw-block interface was not calibrated through independent tribological testing. Instead, we explored literature based friction coefficients (0.1–0.5) together with bounding assumptions (bonded and frictionless) to examine contact sensitivity, which leaves residual uncertainty in the interface model.
Future work will broaden validation to cadaveric or bone mimicking constructs, incorporate the F1717 fatigue protocol and other dynamic loading conditions, quantify the effects of set screw preload and torque, and expand the range of materials and surface conditions (for example, CoCr, coatings, roughness). We also plan to experimentally calibrate and identify key material and contact parameters to reduce uncertainty and improve generalizability.
6 Conclusion
This study validated finite element models of pedicle screw constructs under static flexion and extension in accordance with ASTM F1717-15, with verification and uncertainty quantification incorporated to support model credibility. Experimental repeatability was high (maximum deviations: stiffness 6.4%, yield displacement 9.1%, yield force 7.2%, force at 20 mm 7.5%).
Across configurations, the models reproduced the observed trends but tended to over-predict stiffness- and load-related outputs when fully bonded interfaces were assumed. The largest deviations reached 19.8% for stiffness, 21.5% for yield force, and 18.4% for force at 20 mm; yield-displacement errors peaked at 14.2% under frictionless assumptions. These patterns underscore the central role of interface definition: force and stiffness were highly sensitive to contact conditions, whereas displacement metrics showed moderate variability.
When construct stiffness and force at 20 mm are adopted as primary validation targets, a coefficient of friction in the range 0.10–0.20 provided the most consistent agreement with experiment across screws and loading modes. For decision-making or regulatory use, we recommend avoiding fully bonded interfaces as a default, explicitly reporting and justifying the chosen friction law and coefficient, and, where feasible, experimentally calibrating key material and contact parameters. These steps should improve predictive reliability and facilitate broader adoption of F1717-based simulations in practice.
Data availability statement
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.
Author contributions
OS: Writing – original draft, Software, Writing – review and editing, Data curation, Visualization, Methodology, Validation. BJ: Writing – review and editing, Data curation, Methodology, Formal analysis, Validation, Investigation. CL: Writing – review and editing, Conceptualization, Supervision, Project administration, Methodology, Resources, Investigation, Funding acquisition.
Funding
The authors declare that financial support was received for the research and/or publication of this article. This research was supported by a grant (RS-2023-00215601, RS-2023-00215638) from Ministry of Food and Drug Safety in 2025.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The authors declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Aldieri, A., Curreli, C., Szyszko, J. A., Mattina, A. A. L., and Viceconti, M. (2023). Credibility assessment of computational models according to ASME V&V40: application to the Bologna biomechanical computed tomography solution. Comput. Methods Programs Biomed. 240, 107727. doi:10.1016/j.cmpb.2023.107727
ASME V&V 10-2006 (2016). Guide for verification and validation in computational solid mechanics. New York, NY: The American Society of Mechanical Engineers: Three Park Avenue.
ASME V&V 40-2018 (2018). Assessing credibility of computational modeling through verification and validation: application to medical devices. New York, NY: The American Society of Mechanical Engineers: Two Park Avenue.
ASTM D638-14 Standard (2015). Test methods for tensile properties of plastics. West Conshohocken, PA, United States: ASTM International. doi:10.1520/D0638-14
ASTM E8 Standard (2022). Test methods for tension testing of metallic materials. West Conshohocken, PA, United States: ASTM International. doi:10.1520/E0008_E0008M-22
ASTM F1717 Standard (2015). Test methods for spinal implant constructs in a vertebrectomy model. West Conshohocken, PA, United States: ASTM International. doi:10.1520/F1717-15
Baumann, A. P., Graf, T., Peck, J. H., Dmitriev, A. E., Coughlan, D., and Lotz, J. C. (2021). Assessing the use of finite element analysis for mechanical performance evaluation of intervertebral body fusion devices. JOR SPINE 4 (1), e1137. doi:10.1002/jsp2.1137
Biswas, J. K., Sahu, T. P., Rana, M., Roy, S., Karmakar, S. K., Majumder, S., et al. (2019). Design factors of lumbar pedicle screws under bending load: a finite element analysis. Biocybernet. Biomed. Eng. 39 (1), 52–62. doi:10.1016/j.bbe.2018.10.003
Bujold, M., Pluye, P., Légaré, F., Hong, Q. N., Beaulieu, M. C., Bush, P. L., et al. (2022). Decision-making and related outcomes of patients with complex care needs in primary care settings: a systematic literature review with a case-based qualitative synthesis. BMC Prim. Care 23, 279. doi:10.1186/s12875-022-01879-5
Çetin, A., and Bircan, D. A. (2021). 3D pull-out finite element simulation of the pedicle screw-trabecular bone interface at strain rates. Proc. Institution Mech. Eng. Part H 236 (1), 134–144. doi:10.1177/09544119211044560
Chen, S.-I., Lin, R.-M., and Chang, C.-H. (2003). Biomechanical investigation of pedicle screw–vertebrae complex: a finite element approach using bonded and contact interface conditions. Med. Eng. and Phys. 25 (4), 275–282. doi:10.1016/S1350-4533(02)00219-9
Chu, Y.-L., Chen, C.-H., Tsuang, F.-Y., Chiang, C.-J., Wu, Y., and Kuo, Y.-J. (2019). Incomplete insertion of pedicle screws in a standard construct reduces the fatigue life: a biomechanical analysis. PLOS ONE 14 (11), e0224699. doi:10.1371/journal.pone.0224699
Courcelles, E., Horner, M., Afshari, P., Kulesza, A., Curreli, C., Vaghi, C., et al. (2024). Model credibility. In marco viceconti and luca emili, toward good simulation practice: best practices for the use of computational modelling and simulation in the regulatory process of biomedical products. Springer Nature Switzerland, 43–66. doi:10.1007/978-3-031-48284-7_4
Dickman, C. A., Fessler, R. G., MacMillan, M., and Haid, R. W. (1992). Transpedicular screw-rod fixation of the lumbar spine: operative technique and outcome in 104 cases. J. Neurosurg. 77 (6), 860–870. doi:10.3171/jns.1992.77.6.0860
Eastlack, R. K., Nunley, P. D., Poelstra, K. A., Vaccaro, A. R., Stone, M., Miller, L. E., et al. (2023). Finite element analysis comparing a PEEK posterior fixation device versus pedicle screws for lumbar fusion. J. Orthop. Surg. Res. 18, 855. doi:10.1186/s13018-023-04349-5
Jeong, B. C., Lee, C., Park, J., and Ryu, D. (2023). Identification of optimal surgical plan for treatment of extraocular muscle damage in thyroid eye disease patients based on computational biomechanics. Front. Bioeng. Biotechnol. 10, 969636. doi:10.3389/fbioe.2022.969636
Jiang, G., Wang, S., Xu, L., Li, Z., Feng, N., Qiu, Z., et al. (2025). Biomechanical effects of screw loosening after lumbar PEEK rod and titanium rod fixation: a finite element analysis. Front. Bioeng. Biotechnol. 13, 1533088. doi:10.3389/fbioe.2025.1533088
Kim, C. J., Son, S. M., Choi, S. H., Ryu, D., and Lee, C. (2022). Spinal stability analysis of lumbar interbody fusion according to pelvic type and cage angle based on simplified spinal model with various pelvic indices. Front. Bioeng. Biotechnol. 10, 1002276. doi:10.3389/fbioe.2022.1002276
Kim, T., Goh, T. S., Lee, J. S., Ryu, D., Yoon, S., and Lee, C. (2023). Experimental and numerical investigations for compressive behavior of porous hydroxyapatite bone scaffold fabricated via freeze-gel casting method. Mater. and Des. 231, 112080. doi:10.1016/j.matdes.2023.112080
La Barbera, L., Galbusera, F., Villa, T., Costa, F., and Wilke, H.-J. (2014). ASTM F1717 standard for the preclinical evaluation of posterior spinal fixators: can we improve it? Proc. Institution Mech. Eng. Part H 228 (10), 1014–1026. doi:10.1177/0954411914554244
Leksycki, K., Feldshtein, E., Maruda, R. W., Khanna, N., Krolczyk, G. M., and Pruncu, C. I. (2022). An insight into the effect surface morphology, processing, and lubricating conditions on tribological properties of Ti6Al4V and UHMWPE pairs. Tribol. Int. 170, 107504. doi:10.1016/j.triboint.2022.107504
Morrison, T. M., Pathmanathan, P., Adwan, M., and Margerrison, E. (2018). Advancing regulatory science with computational modeling for medical devices at the FDA’s office of science and engineering laboratories. Front. Med. 5, 241. doi:10.3389/fmed.2018.00241
Morrison, T. M., Hariharan, P., Funkhouser, C. M., Afshari, P., Goodin, M., and Horner, M. (2019). Assessing computational model credibility using a risk-based framework: application to hemolysis in centrifugal blood pumps. ASAIO J. 65 (4), 349–360. doi:10.1097/MAT.0000000000000996
Nagaraja, S., Loughran, G., Baumann, A. P., Kartikeya, K., and Horner, M. (2024). Establishing finite element model credibility of a pedicle screw system under compression-bending: an end-to-end example of the ASME V&V 40 standard. Methods 225, 74–88. doi:10.1016/j.ymeth.2024.03.003
Park, D., Lee, S. H., Lee, S., Park, J., Yang, H. G., Kim, C., et al. (2024). The efficacy of cervical pedicle screw is enhanced when used with 5.5-mm rods for metastatic cervical spinal tumor surgery. Neurospine 21 (1), 352–360. doi:10.14245/ns.2346778.389
Park, J. S., Goh, T. S., Lee, J. S., and Lee, C. (2025). Impact of asymmetric L4–L5 facet joint degeneration on lumbar spine biomechanics using a finite element approach. Sci. Rep. 15, 12613. doi:10.1038/s41598-025-97021-3
Rana, M., Roy, S., Biswas, P., Biswas, S. K., and Biswas, J. K. (2020). Design and development of a novel expanding flexible rod device (FRD) for stability in the lumbar spine: a finite-element study. Int. J. Artif. Organs 43 (12), 803–810. doi:10.1177/0391398820917390
Sim, O., Ryu, D., Lee, J., and Lee, C. (2022). Stress distribution on spinal cord according to type of laminectomy for large focal cervical ossification of posterior longitudinal ligament based on finite element method. Bioengineering 9 (10), 519. doi:10.3390/bioengineering9100519
Smith, J. S., Shaffrey, C. I., Ames, C. P., Demakakos, J., Fu, K. M. G., Keshavarzi, S. Y., et al. (2012). Assessment of symptomatic rod fracture after posterior instrumented fusion for adult spinal deformity. Neurosurgery 71 (4), 862–868. doi:10.1227/NEU.0b013e3182672aab
Stone, E. G. (2017). Unintended adverse consequences of a clinical decision support system: two cases. J. Am. Med. Inf. Assoc. 25 (5), 564–567. doi:10.1093/jamia/ocx096
US Food and Drug Administration (2016). Reporting of computational modeling studies in medical device submission: guidance for industry and food and drug administration staff.
US Food and Drug Administration (2023). Assessing the credibility of computational modeling and simulation in medical device submissions: guidance for industry and food and drug administration staff.
Keywords: ASTM standard F1717, finite element analysis (FEA), medical device digital tool (MDDT), ASME validation & verification (VandV) 40, pedicle screw, experiment, uncertainty quantification (UQ)
Citation: Sim O, Jeong BC and Lee C (2025) Verification, validation, and uncertainty quantification of finite element analysis results for pedicle screw assemblies under ASTM F1717 flexion and extension testing. Front. Bioeng. Biotechnol. 13:1623336. doi: 10.3389/fbioe.2025.1623336
Received: 05 May 2025; Accepted: 24 November 2025;
Published: 08 December 2025.
Edited by:
Xiaoyu Du, ETH Zürich, SwitzerlandCopyright © 2025 Sim, Jeong and Lee. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Chiseung Lee, dmljdG9yaWNoQHB1c2FuLmFjLmty
On Sim1