Development of a MediaPipe-based framework for biomechanical quantification of table tennis forehand strokes

Lyu, Yuanyuan; Duan, Xiaoling; Yang, Chen; Ye, Qiang

doi:10.3389/fspor.2025.1635581

ORIGINAL RESEARCH article

Front. Sports Act. Living, 15 August 2025

Sec. Biomechanics and Control of Human Movement

Volume 7 - 2025 | https://doi.org/10.3389/fspor.2025.1635581

This article is part of the Research TopicRevolutionizing sports science: Biomechanical models, wearable tech, and AIView all 5 articles

Development of a MediaPipe-based framework for biomechanical quantification of table tennis forehand strokes

Yuanyuan Lyu¹

Xiaoling Duan²

Chen Yang¹

Qiang Ye^3*

¹School of Sport and Health Science, Nanjing Sport Institute, Nanjing, China
²School of Table Tennis and Badminton, Nanjing Sport Institute, Nanjing, China
³Information Affairs Office, Nanjing Sport Institute, Nanjing, China

Introduction: This study aimed to quantify kinematic relationships across body segments during forehand strokes to provide interpretable metrics for single-camera based lightweight table tennis diagnostics.

Methods: We analyzed 34 female players (aged 9.1–21.7 years) from provincial teams, recording a total of 340 strokes (10 per player). An SVM model was used to predict ball speed, after which 320 strokes (8–10 per player) were retained by removing outliers in ball speed. From MediaPipe position time series, we calculated velocity, angle and angular velocity time series, and extracted kinematic parameters from these time series, including range, mean/peak/impact values. Within-subject correlation coefficients (r_ws) were calculated to identify key biomechanical parameters that contribute to the ball speed, while between-subject correlation coefficients (r_bs) were used to detect the relationship between age/height and ball speed.

Results: Ball speed increased with greater playing-side arm linear movement at the shoulder (r_ws = 0.51 to 0.63), elbow (r_ws = 0.63 to 0.70) and wrist (r_ws = 0.50 to 0.60), as well as with enhanced rotational motion at the playing-side upper arm (r_ws = 0.65 to 0.71), shoulder line (r_ws = 0.54 to 0.57), and hip line (r_ws = 0.51 to 0.59). Conversely, ball speed decreased with excessive contralateral shoulder horizontal flexion/extension (r_ws = −0.44 to −0.62) and playing-side elbow flexion-extension (r_ws = −0.35). At the population-level, ball speed increases with age before 14.3 years (r_bs = 0.68) but plateaus thereafter (r_bs = 0.17).

Discussion: This MediaPipe-based framework demonstrates potential for efficient biomechanical analysis in table tennis, providing a promising foundation for lightweight real-time analysis solutions.

1 Introduction

Table tennis requires precise whole-body coordination and accurate stroke timing, creating unique biomechanical analysis challenges. Biomechanical analysis reveals underlying technique patterns and improves player training and performance. Researchers employ various devices to study motion characteristics: optical systems (1–5), pressure and force sensors (1, 2), electromyography (EMG) (3), and inertial measurement units (IMUs) attached to the body or racket (6).

Optical devices serve as primary tools, used alone or combined with other sensors. They capture precise three-dimensional spatial data while allowing players to move freely without interference. Researchers examine movement patterns across skill levels, identifying subtle joint and muscle dynamics invisible to the naked eye. Wang et al. used optical systems and EMG to compare elite and amateur players. Elite players showed greater ankle eversion and larger knee and hip flexion angles during backswing and follow-through phases (3). He et al. employed a VICON optical system to study driving leg kinematics during topspin forehand loops. They found significant ankle movement differences between elite and intermediate players, recommending that intermediate players enhance lower limb muscle response to improve energy transfer (7). Qian et al. compared superior and intermediate players using VICON, finding that superior players exhibited greater hip flexion and knee external rotation at stroke initiation, plus increased hip internal rotation and extension at stroke completion (1). However, these optical systems require controlled environments, expensive multi-camera setups, high-frequency capabilities, time-consuming marker placement, large laboratories, and skilled technicians. These limitations restrict widespread use in practical training.

Researchers have adopted lightweight machine learning-based pose estimation models (e.g., OpenPose, MediaPipe Pose, PoseNet, AlphaPose, DeepLabCut, HRNet, BlazePose, EfficientPose, MoveNet) as alternatives to complex optical motion analysis systems (8–12). These models provide human body landmarks for further development (8, 11). In sports and exercise, these models analyze movements (e.g., running, jumping, squatting) to optimize technique (8, 11, 12), detect injury-prone postures, and provide real-time form feedback via mobile apps (12). They also assess team dynamics, monitor exercise quality in fitness apps, and enable remote training guidance (9). Compared to other models requiring complex configurations, MediaPipe provides easy-to-use APIs and comprehensive documentation, lowering the development barrier. Developers can quickly integrate it into existing projects (9, 10, 13). These advantages make MediaPipe Pose the preferred solution suitable for scenarios requiring real-time performance and cross-platform deployment.

While lightweight machine learning-based pose estimation models exhibit lower precision compared to high-fidelity motion capture systems like VICON (10, 13, 14), recent advances in machine learning have significantly enhanced the utilization of keypoint data from lightweight pose estimation models (10, 15). Machine learning includes classical approaches and deep learning architectures. Integrating these methods with pose estimation data creates new opportunities for automated movement quality assessment (AQA) in athletic and rehabilitative applications.

Classical machine learning approaches include Decision Trees, Random Forest, SVM, Naive Bayes, K-NN, and Linear/Logistic Regression (9). These methods address classification (16–20), regression (21–23), clustering tasks, and feature importance ranking (24). Naive Bayes offers mathematical robustness and efficiency but relies on independence assumptions (16). Decision Trees are capable of identifying contributing factors in biomechanical analyses, with applications including knee biomechanical asymmetry (24). Random Forest combines multiple trees to improve prediction accuracy and reduce overfitting (24, 25), successfully predicting joint angles and moments (21). Linear regression predicts continuous outcomes, such as running stride temporal variables and peak vertical ground reaction force (22), while logistic regression addresses classification problems, such as binary musculoskeletal disorder classification (16, 17). Linear and logistic regression can work together to identify key predictors and examine high vs. low knee abduction moments (18). Support Vector Machines (SVM) provide nonlinear classification and regression capabilities through kernel methods, making them well-suited for complex biomechanical tasks among classical machine learning approaches. Applications of SVM include predicting athlete aerobic fitness (19), analyzing running gait (20), and predicting fastball speed using kinetic and kinematic predictors (23).

These classical machine learning approaches offer significant advantages in interpretability (16, 23) and computational efficiency (18, 21), making them well-suited for clinical applications and scenarios with limited data availability. However, their dependency on manual feature engineering (26), limited capacity for processing high-dimensional data structures (27), constrained nonlinear modeling capabilities, and insufficient handling of temporal sequences (28) restrict their effectiveness in advanced analytical tasks (26). In contrast, deep learning architectures—including Convolutional Neural Networks (CNNs), Recurrent Neural Networks/Long Short-Term Memory networks (RNNs/LSTMs), and Transformers—provide superior accuracy and automation capabilities for complex spacial or temporal movement analysis. Nevertheless, these approaches require substantial computational resources and large training datasets to achieve optimal performance (29).

CNNs excel at processing visual data through hierarchical structures and have been successfully applied to performance classification and kinetic parameter prediction (27, 30). While CNNs effectively extract spatial features, they struggle with temporal sequence modeling, prompting researchers to integrate CNNs with RNNs to achieve comprehensive spatial and temporal analysis (31, 32). RNNs handle time-series prediction effectively (29) but suffer from vanishing and exploding gradient problems (33). LSTM units address these limitations as a specialized RNN component (29, 33, 34). LSTMs have predicted joint reaction forces (35), segmented jump phases (36), and modeled stress evolution in skeletal muscle tissue (33). Transformers extend RNN capabilities by overcoming LSTM’s long-range dependency limitations. Their attention mechanisms capture relationships across entire sequences simultaneously, providing superior sequence modeling and interpretability through attention weights while enabling efficient parallelization.

Recent advances in table tennis research have leveraged lightweight machine learning models and pose estimation techniques to analyze player performance. Chen et al. enhanced swing recognition through an improved OpenPose framework integrated with MobileNet v3-small and InceptionTime architectures (8). Building on pose estimation approaches, Llanos et al. developed a comprehensive assessment system using OpenPose and SVM-RBF classifiers to differentiate four key postural elements: upper body lean, knee bend, forehand/backhand strokes, and footwork patterns (11). For real-time applications, He et al. combined YOLOv5 and MediaPipe for stroke and posture assessment, implementing dynamic time warping algorithms to compare temporal sequences of joint angles (elbow-shoulder-hip) between player motions and reference techniques (12).Similarly, Huang and colleagues employed OpenPose for skeleton extraction combined with SVM for real-time swing recognition, incorporating Dynamic Time Warping (DTW) algorithms to evaluate technique through keypoint trajectory comparison and identification of suboptimal joint movements (37). Transformer-based approaches have also shown promise, with Dong and colleagues utilizing MediaPipe for human keypoint extraction and employing Transformer models for stroke recognition across six distinct stroke types (38).

Despite advances in lightweight pose estimation for table tennis, existing methods focus primarily on stroke classification without quantifying how multi-segment body kinematics influence ball impact outcomes. These approaches have not yet established measurable relationships between whole-body movement patterns and performance metrics, limiting their utility for diagnostic applications in sports training.

This study aimed to quantify kinematic relationships across body segments during forehand strokes to provide interpretable metrics for lightweight table tennis diagnostics. Using single-camera whole-body landmark detection and ball speed measurement, we employed statistical analysis to identify key kinematic factors influencing ball performance. This research bridges lightweight human motion capture with actionable biomechanical insights for practical sports training applications.

2 Methods

The research method consists of the following steps: (1) Pre-train a Support Vector Machine (SVM) model to serve as a tool for direct ball speed measurement from video footage. (2) Participants perform forehand strokes while MediaPipe captures the 3D position series of human body landmarks. (3) Calibrate the position series to compute velocity, angle, and angular velocity sequences, from which kinematic summaries are extracted. (4) Apply the pre-trained SVM model to predict ball speed based on the ball trajectory. (5) Conduct statistical analysis to examine correlations between kinematic summaries and ball velocity, as well as relationships between participant demographics (age and height) and ball speed.

2.1 SVM ball speed model

A Support Vector Machine (SVM) regression model was pre-trained on ball coordinates (x and y values) extracted from video frames to predict ball speed. This model was then employed in the main experiment to streamline the measurement process, ensuring efficiency for practical applications.

Ball speed specifically denotes average horizontal ball speed, which critically measures offensive performance in table tennis. The table surface was divided into $10 cm \times 10 cm$ squares using fine lines to accurately identify ball landing spots and determine horizontal flight distance. A centrally positioned audio recorder captured sounds from ball-racket and ball-table contact. These sound waves were analyzed using Adobe Audition, which provides 1 millisecond temporal resolution, enabling precise measurement of impact time intervals. Corrections were applied based on sound velocity at 20 $^{\circ}$ C (343 m/s) to account for varying sound travel times caused by different distances between impact points, drop points, and the recorder. Ball speed was calculated by dividing horizontal flight distance by travel time.

The SVM regression model was developed using 517 preliminary stroke measurements from standardized ball trajectories at controlled speeds. Manually annotated ( $x$ , $y$ ) coordinates from six consecutive video frames created 12-dimensional feature vectors capturing ball movement immediately before screen exit.

Using six-frame coordinates with SVM regression rather than simple two-point measurement offers several advantages: (1) Table tennis ball trajectories are inherently nonlinear due to air resistance, spin effects, and gravitational forces. SVM’s ability to find optimal decision boundaries in high-dimensional space makes it particularly suitable for processing multi-frame coordinate data and learning complex velocity patterns that simple kinematic equations cannot capture. (2) Six frames provide finer temporal sampling of ball motion, effectively filtering random noise and measurement errors common in single-frame coordinate detection. (3) If ball detection fails in one or two frames, SVM can still make accurate predictions using remaining frames, whereas two-point methods would fail completely.

The 517 preliminary measurements were split into training (80%) and test (20%) sets, with the test set remaining independent throughout the training process. Hyperparameter optimization employed 5-fold cross-validation with grid search across $C$ parameters [0.1, 1, 10, 100, 1,000, 10,000, 100,000] and $γ$ values [1, 0.1, 0.01, 0.001, 0.0001, 0.00001]. This strategy ensured robust parameter selection while preventing overfitting. The optimal configuration achieved $C = 100,000$ and $γ = 0.001$ , yielding best cross-validation score (R $^{2} = 0.987$ ) and test set R $^{2} = 0.981$ (Figure 1).

Figure 1

Scatter plot showing SVM predicted ball speed versus acoustic measured ball speed, both in kilometers per hour. Data points are clustered near the y=x line. The correlation coefficient is 0.981, and the root mean square error is 1.398 km/h.

Figure 1. Validation of SVM ball speed prediction model against acoustic measurements with high accuracy.

Bland–Altman analysis demonstrated excellent agreement between predicted and measured speeds in the test set. Most data points fell within 95% confidence limits with minimal systematic bias of 0.02 km/h (Figure 2). This validated SVM model was employed for lightweight ball speed prediction.

Figure 2

Bland-Altman plot showing the difference in speeds between two measurement methods (y-axis) against their mean (x-axis) in kilometers per hour. Data points are scattered around the mean difference line at 0.02 km/h. Upper and lower limits of agreement are depicted with dashed lines at +2.76 and -2.72 km/h, respectively, with shaded areas indicating ±1.96 standard deviations.

Figure 2. Bland–Altman analysis of predicted vs. measured ball speeds. Dotted lines indicate limits of agreement ( $\pm 1.96$ SD), with shaded areas representing 95% confidence intervals for the mean difference and limits of agreement.

2.2 Participants and protocol

This study included 34 female table tennis players (aged 9.1–21.7 years) from provincial teams training at Nanjing Sport Institute in February 2023. Twenty-six were from the Jiangsu team and eight from the Shanghai team. (Table 1). Power analysis using PASS 2023 demonstrated adequate statistical power for within-subject correlations (93%, ICC $=$ 0.5) and between-subject correlations (87%, r $=$ 0.5) at $α =$ 0.05. All participants were physically fit with no training contraindications. The Ethics Committee at Nanjing Sport Institute approved these procedures (Approval No. RT-2023-02). Participants or their guardians provided written informed consent before the study began.

Table 1

Table 1. Participant characteristics, means $\pm$ SD or N (%).

An experienced coach delivered balls at slow speed (approximately 24 km/h) to allow players adequate physical and mental preparation based on her movements and optimal shot timing. Right-handed players positioned themselves on the table’s left side and executed forehand strokes toward the opposite corner, maintaining ball trajectory approximately 20 cm above the table surface. A single camera (SONY FDR-AX700) positioned 95 cm above ground level captured body movements and ball trajectories from a $45^{\circ}$ angle on the right front (Figure 3). The camera operated at $1920 \times 1080$ pixel resolution and 100 fps frame rate. The experimental setup was mirrored across the net for left-handed players: participants stood on the table’s right side, and the camera position was correspondingly adjusted. The first ten valid strokes per player–excluding edge and net contacts–were recorded, resulting in a total of 340 strokes across 34 participants. Subsequent processing removed 20 strokes due to ball speed outliers, leaving 320 strokes (8–10 per player). The outlier was identified using the criterion of values exceeding $Q 3 + 1.5 \times I Q R$ or falling below $Q 1 - 1.5 \times I Q R$ , where $Q 1$ and $Q 3$ represent first and third quartiles, respectively, and $I Q R$ denotes the interquartile range, reflecting the spread within the middle 50% of data. Finally, for correlation analysis, only the fastest and slowest two strokes per player (68 strokes in total) were selected.

Figure 3

Diagram illustrating a post-stroke and serve trajectory with a standing position marked by footprints. Measurements are labeled as 2.2 meters and 0.6 meters along the x, y, and z axes. A camera location is indicated by a diamond shape.

Figure 3. Camera positioning and coordinate system for right-handed players. The MediaPipe coordinate system was modified such that the x and z axes lie parallel to the floor plane and perpendicular to each other, while the y-axis points upward, perpendicular to the transverse plane.

2.3 Landmark tracking and data processing

2.3.1 Human landmark tracking

We tracked thirty-three landmark positions for each player using MediaPipe Pose (Version 0.10.14), as defined by Bazarevsky et al. (39) (Figure 4). Each landmark was represented by three-dimensional coordinates ( $x$ , $y$ , $z$ ), where x and y indicate relative position in the 2D image, and z reflects regressed depth value. Data were stored as time series sampled at 100 Hz, matching the video frame rate. Left-handed players’ video frames were horizontally flipped before MediaPipe tracking to maintain consistency with right-handed movement patterns.

Figure 4

Diagram of a human figure with numbered joints and body parts, resembling a skeleton used in pose estimation. Numbers are labeled corresponding to body parts such as nose, eyes, ears, shoulders, elbows, wrists, hands, hips, knees, ankles, heels, and foot indices.

Figure 4. MediaPipe landmarks.

2.3.2 Ball trajectory and impact timing annotation

To ensure the progression of the main experiment, this study employed manual annotation to identify ball flight trajectories and racket-ball impact timing. Ball trajectories were determined by marking the coordinates of the ball in the final six frames before it exited the video frame boundaries. These trajectory data were subsequently used to predict ball speed using the pre-trained Support Vector Machine (SVM) model. Meanwhile, the manually identified racket-ball impact timing served as temporal reference points to extrac forward swing phase.

2.3.3 Position data filtering

Landmark position time series were filtered using a low-pass Finite Impulse Response (FIR) filter designed with a Hamming window and 31 taps, implemented through the $s c i p y . s i g n a l . f i r w i n$ function. The filter applied 100 Hz sampling rate and 0.08 normalized cutoff frequency, corresponding to 4 Hz actual cutoff.

2.3.4 Position data scaling

Two scaling factors converted landmark coordinates from video frames to actual positions. The first factor, calculated solely for the initial frame, addresses disparity between MediaPipe-estimated body dimensions and actual measurements. It compares cumulative lengths of bilateral shoulder-hip-knee-ankle segments calculated by MediaPipe with manually measured lengths. The second factor, computed for every subsequent frame, compensates for apparent size variations due to body movement, maintaining proportional consistency across frames. This factor adjusts landmark positions by comparing cumulative segment lengths in the current frame to those in the first frame. Each landmark position is then multiplied by both factors to obtain real-world coordinates.

2.3.5 Dynamic origin calibration

MediaPipe’s coordinate system is based on the camera coordinate system, which originally placed its origin $(0, 0, 0)$ between the hips, with the x-axis extending rightward, y-axis downward, and z-axis toward the camera. We still use the camera coordinate system, but repositioned the origin to the time-averaged midpoint between both feet across all frames during the current forehand stroke and redefined the y-axis to extend upward (Figure 3). Landmark coordinates were then transformed relative to this motion-adaptive origin.

2.3.6 Kinematic parameter calculation

Landmark velocities were computed from position time series using the central difference method. Velocity components in x-, y-, and z-directions were calculated from Equation 1:

v_{x} = \frac{x_{t + 1} - x_{t - 1}}{2 Δ t}, v_{y} = \frac{y_{t + 1} - y_{t - 1}}{2 Δ t}, v_{z} = \frac{z_{t + 1} - z_{t - 1}}{2 Δ t}, (1)

where $x$ , $y$ , and $z$ represent each landmark’s camera coordinates.

Body segments, defined as anatomical regions between two landmarks, were analyzed for orientation angles in the xy-, yz-, and zx-planes. Angles were computed from Equation 2:

α = \arctan (\frac{y}{x}), β = \arctan (\frac{z}{y}), γ = \arctan (\frac{x}{z}), (2)

where $x$ , $y$ , and $z$ denote landmark’s camera coordinates. The $x y$ -plane is perpendicular to the camera’s optical axis, the $y z$ -plane is parallel to the camera’s optical axis, and the $z x$ -plane is parallel to the floor.

Angular velocities in each plane ( $ω_{x y}$ , $ω_{y z}$ , $ω_{z x}$ ) were derived using Equation 3:

ω_{x y} = \frac{α_{t + 1} - α_{t - 1}}{2 Δ t}, ω_{y z} = \frac{β_{t + 1} - β_{t - 1}}{2 Δ t}, ω_{z x} = \frac{γ_{t + 1} - γ_{t - 1}}{2 Δ t} . (3)

Joint angles were defined as angles between two adjacent body segments, determined by three landmarks (e.g., points A, B, and C, with B at the joint). Joint angle $θ$ was calculated from vectors $\vec{B A}$ and $\vec{B C}$ using Equation 4:

\cos θ = \frac{\vec{B A} \cdot \vec{B C}}{‖ \vec{B A} ‖ \cdot ‖ \vec{B C} ‖}, (4)

with corresponding angular velocity computed from Equation 5:

ω = \frac{θ_{t + 1} - θ_{t - 1}}{2 Δ t} . (5)

2.3.7 Forward swing phase extraction

A forehand stroke consists of three distinct phases: backswing, forward swing, and follow-through. This study focuses exclusively on the forward swing phase, defined as the interval during which players maximally accelerate the racket toward the ball to achieve precise contact. Forward swing duration was extracted from a duplicate playing-side wrist (Landmark 16) resultant velocity time series. The series was processed using the previously applied 31st-order FIR filter again, and its second derivative was calculated to identify inflection points. The inflection point immediately before ball impact marked phase start time, while the inflection point directly after impact defined its end time. All landmark position and velocity time-series data were then truncated based on calculated boundaries. Taking the playing-side wrist (Landmark 16) as an example, Figure 5 displays its position and velocity time series, with the shaded portion representing the truncated forward swing phase. The duration of the forward stroke phase across all players is $30.34 \pm 2.03$ frames or $303.4 \pm 20.3$ milliseconds.

Figure 5

Graphs depicting position and velocity series over time. The left graph shows position in meters for x, y, z axes, and R with curves in blue, green, purple, and red. The right graph displays velocity in meters per second for the same axes. Both graphs feature a time range from -60 to 60 frames with a highlighted section from -20 to 20 frames.

Figure 5. Forward swing phase in position and velocity time series. R $=$ resultant values; x, y, z $=$ components (x: rightward, y: upward, z: toward camera). All series are time-aligned to ball impact (vertical dashed line). Shaded area indicates the mathematically extracted forward swing phase.

We compared the mathematically derived start and end points of the forward swing phase with manually annotated timepoints obtained through frame-by-frame expert analysis (n $=$ 320). The validation results demonstrate excellent agreement between the two methods. Bland–Altman analysis showed that $> 95 %$ of measurements fell within the limits of agreement for both start and end time detection. Both timepoint detections exhibited excellent reliability (ICC: 0.968 and 0.987) and strong correlations with manual annotation (r: 0.974 and 0.988, both $p < 0.001$ ). The mathematical method showed minimal systematic bias with mean differences of $- 1.016$ ms for start time and $- 0.278$ ms for end time. Mean relative errors were low at 3.69% and 1.46% respectively (Table 2).

Table 2

Table 2. Comparison of mathematical method and manual measurement for forward swing phase timing detection.

2.4 Data validation

2.4.1 Repeated measurement reliability

To compare the consistency and similarity of repeated position time series measurements, all position time series were truncated to a uniform duration of 0.8 s, which was slightly shorter than the minimum stroke length observed across all participants. Two strokes were randomly selected per player. Three metrics were employed for consistency and similarity validation: the intraclass correlation coefficient (ICC), normalized Dynamic Time Warping (DTW) similarity, and cosine similarity (CS). For each participant, these three similarity measures were computed for their stroke pairs, and overall similarity values were calculated as the mean across all participants.

The ICC measures reliability and consistency between repeated measurements, calculated as: $I C C = \frac{M S_{B} - M S_{W}}{M S_{B} + (k - 1) M S_{W}}$ where $M S_{B}$ represents between-subjects mean square, $M S_{W}$ represents within-subjects mean square, and $k$ represents the number of measurements per subject.

Normalized DTW similarity captures temporal alignment between sequences with potential time warping, computed as: $S_{D T W} = \frac{1}{1 + \frac{D T W (X, Y)}{L_{a v g} \cdot R_{d a t a}}}$ where $L_{a v g}$ represents average sequence length and $R_{d a t a}$ denotes data range.

Cosine similarity measures angular similarity between vectors, defined as: $S_{c o s} = 1 - \frac{X \cdot Y}{‖ X ‖ \cdot ‖ Y ‖}$ .

Table 3 demonstrate strong measurement consistency across different similarity metrics. Most landmarks achieved excellent reliability ( $⩾ 0.90$ ) in multiple dimensions, confirming the robustness of the motion capture model and the consistency of skilled athletic performance.

Table 3

Table 3. Consistency and similarity measures for repeated landmark position measurements: intraclass correlation coefficients (ICC), normalized DTW similarity, and cosine similarity.

Figure 6 demonstrates a single player’s position trajectories across multiple trials, revealing high consistency, particularly in the dominant-side wrist’s x and y coordinates, while the z-coordinate shows greater variability. This consistent performance across trials reflects both the movement reliability of professionally trained players and MediaPipe’s consistency and similarity in repeated assessments.

Figure 6

Graphs show the movement of right and left wrists in three dimensions over time. Each subplot has colored lines representing different trials. The x-axis indicates time in frames per one hundredth of a second, and the y-axis shows position in meters. Vertical dashed lines mark ball-racket impact time point.

Figure 6. Wrist position time series for a single participant across multiple trials. x, y, z $=$ position components (x: rightward, y: upward, z: toward the camera), and $R = \sqrt{x^{2} + y^{2} + z^{2}}$ . Lines of different colors represent the eight trials. All time series are aligned to ball impact (vertical dashed line).

2.4.2 Left-handed data validation

Left-handed video frames were horizontally flipped before MediaPipe processing to generate coordinate data comparable to right-handed players. Landmark position curve analysis employed multiple statistical approaches to assess group differences. Curves were preprocessed through cubic spline interpolation to achieve uniform temporal resolution. Eighteen kinematic features were extracted, including basic statistics (mean, standard deviation, maximum, minimum, range, median), distribution characteristics (skewness, kurtosis), peak properties (peak count, peak maximum, peak mean), temporal dynamics (mean velocity, maximum acceleration, zero crossings), spectral properties (dominant frequency, spectral centroid), and curve morphology (curve length, area under curve). Statistical comparisons utilized appropriate parametric or non-parametric tests based on normality and variance assessments. Bonferroni correction addressed multiple comparison issues. Multivariate analysis included dimensionality reduction and clustering for pattern recognition. Curve similarity was quantified using distance matrices comparing intra-group vs. inter-group variations.

Results demonstrated no significant differences in resultant position curves for all 33 landmarks between left-handed and right-handed players (corrected $p > 0.05$ ), suggesting both groups originated from the same population. This finding validates the comparability of left-handed and right-handed player data. Figure 7 illustrates the consistent resultant position curves between groups for playing-side wrist, elbow, shoulder, and contralateral wrist, elbow, shoulder landmarks.

Figure 7

Six line graphs show limb position changes over time for playing-side and contralateral wrists, elbows, and shoulders. The vertical axis represents position in meters, and the horizontal represents time in frames (1/100s). Red lines indicate left-handed individuals; blue lines indicate right-handed individuals.

Figure 7. Average resultant position curves for left-handed (n $=$ 8, coral pink) vs. right-handed (n $=$ 26, turquoise) players, aligned to ball impact (dashed line).

2.5 Statistical analysis

The fastest and slowest strokes from each player were paired and analyzed using two correlation approaches: within-subject and between-subject. We chose correlation analysis over machine learning approaches for several reasons: (1) it provides direct, interpretable relationships between biomechanical variables that coaches and researchers can immediately understand and apply; (2) correlation analysis is statistically appropriate and robust for small sample sizes; and (3) this exploratory approach helps identify which biomechanical factors are most relevant before developing more complex predictive models.

2.5.1 Within-subject corrilation

Within-subject correlation is a statistical method used to assess association between paired measures across multiple occasions for each individual (40). This technique examines whether an increase in one variable corresponds to change in another (41). For example, it determines whether faster wrist movement correlates with higher ball speeds.

Statistical summaries of positional and angular changes are provided in Table 4. These kinematic parameters were extracted across three levels: (1) Landmark parameters including positional range (PR), mean velocity (MV), peak velocity (PV), and impact velocity (IV), decomposed into resultant components (subscript R) and axial components (subscripts x, y, z); (2) Segment parameters including angular range (AR), mean angular velocity (MAV), peak angular velocity (PAV), and impact angular velocity (IAV), analyzed as resultant components (subscript R) and planar components (subscripts xy, yz, zx); (3) Joint parameters including joint angular range (JAR), joint mean angular velocity (JMAV), joint peak angular velocity (JPAV), and joint impact angular velocity (JIAV), treated as scalar quantities without directional decomposition. Within-subject correlations (r_ws) between kinematic parameters and ball speeds were calculated using repeated measures correlation analysis ( $P i n g o u i n . r m_c o r r$ function), following the rmcorr methodology of Bakdash and Marusich (42) (Equation 6).

r_{rm} = \sqrt{\frac{S S_{Measure}}{S S_{Measure} + S S_{Error}}} (6)

Sign of r_{rm} ( positive or negative) = Sign of β

where $S S_{Measure}$ represents the sum of squares for the measure (dependent variable), $S S_{Error}$ represents the sum of squares for error (residual variance), and $β$ represents the common regression slope coefficient estimated across all participants, estimated through analysis of covariance (ANCOVA).

Table 4

Table 4. Summary of kinematic variables during the forward swing phase.

For each player, the fastest stroke was compared with the slowest stroke to compute r_ws. This method aimed to identify key factors driving differences between fastest and slowest ball speeds.

2.5.2 Between-subject corrilation

Between-subject correlation evaluates whether individuals with higher values in one variable tend to exhibit higher values in another across subjects (41). For example, it can assess whether age or height correlates with ball speed. The between-subject correlation coefficient (r_bs) is calculated using Equation 7 proposed by Bland and Altman (41):

r = \frac{\sum m_{i} \bar{x_{i}} \bar{y_{i}} - \frac{\sum m_{i} \bar{x_{i}} \cdot \sum m_{i} \bar{y_{i}}}{\sum m_{i}}}{\sqrt{(\sum m_{i} {\bar{x_{i}}}^{2} - \frac{{(\sum m_{i} \bar{x_{i}})}^{2}}{\sum m_{i}}) (\sum m_{i} {\bar{y_{i}}}^{2} - \frac{{(\sum m_{i} \bar{y_{i}})}^{2}}{\sum m_{i}})}} (7)

where $m_{i}$ is the number of measurements for subject $i$ (stroke count for a given player), and $\bar{x_{i}}$ and $\bar{y_{i}}$ are the means of two variables across multiple measurements for subject $i$ .

3 Results

3.1 Within-subject correlation between biomechanical kinematic characteristics and ball speed

3.1.1 Relationship between positional range, linear velocity of landmarks, and ball speed

The positional range and velocities (mean, peak, and at impact) of the head, playing-side shoulder (r_ws = 0.51, 0.63, 0.61 for PR $_{x}$ , MV $_{x}$ , PV $_{x}$ ), elbow (r_ws = 0.63, 0.70, 0.69 for PR $_{R}$ , MV $_{R}$ , PV $_{R}$ ), wrist (r_ws = 0.60, 0.56, 0.50 for MV $_{R}$ , PV $_{R}$ , IV $_{R}$ ), fingers, and both hips show significant positive correlations with ball speed, particularly along the x-axis. The resultant peak and impact velocities of the legs reveal distinct patterns: the playing-side knee shows a weak positive correlation, while the contralateral knee and ankle display weak to moderate negative correlations with ball speed (Figure 8 and Table 5).

Figure 8

Diagram displaying four rows and four columns of linked skeletal models, indicating positional range, mean velocity, peak velocity, and impact velocity across resultant, x-axis, y-axis, and z-axis metrics. Nodes on the models are color-coded from red to blue, representing a scale from 0.9 to -0.9.

Figure 8. Heatmap visualization of within-subject correlation between landmark kinematics and ball speed in the camera coordinate system, where the x-axis points right, y-axis points up and z-axis points toward the camera (Figure 3). The resultant value represents the square root of the sum of squared components from the x, y, and z axes. Colored circles indicate significant correlations, with red representing positive correlations and blue representing negative correlations. Numbers indicate anatomical landmarks.

Table 5

Table 5. Within-subject correlation coefficients (r_ws) between landmark kinematic variables (positional range and velocities) and ball speed.

3.1.2 Relationship between angular range, angular velocity of body segments, and ball speed

The resultant angular range and mean angular velocity of the shoulder line (11–12) (r_ws = 0.57, 0.54 for MAV $_{R}$ , MAV $_{zx}$ ) and hip line (23–24) (r_ws = 0.51, 0.59 for AR $_{R}$ , MAV $_{zx}$ ) show moderate correlation with ball speed. The playing-side upper arm (12–14) demonstrates significant positive correlations in both angular range and angular velocities with ball speed (r_ws = 0.65, 0.71, 0.70 for AR $_{xy}$ , MAV $_{xy}$ , PAV $_{xy}$ ), while the contralateral upper arm (11–13) shows no significant relationship. In the lower limbs, the playing-side segments (24–26, 26–28) present weak to moderately positive correlations, whereas the contralateral segments (23–25, 25–27) display weak to moderate negative correlations (Figure 9, Table 6).

Figure 9

Comparison of angular metrics across different planes in a heatmap figure diagram. Four rows represent different angular measurements: angular range, mean angular velocity, peak angular velocity, and impact angular velocity. Columns depict comparisons across resultant, xy-plane, yz-plane, and zx-plane. Each stick figure is color-coded from dark red to dark blue, indicating correlation coefficient values from -0.9 to 0.9, as shown in the color bar on the right. Various numbers near joints illustrate body landmarks.

Figure 9. Heatmap visualization of within-subject correlation between segment kinematics and ball speed. The human skeleton model displays correlations for different kinematic components across three planar projections (xy, yz, and zx-plane) in the camera coordinate system, where the x-axis points right, y-axis points up, and z-axis points toward the camera (Figure 3). The resultant value represents the combined magnitude of these components. Colored lines indicate significant correlations, with red representing positive correlations and blue representing negative correlations. Numbers indicate anatomical landmarks.

Table 6

Table 6. Within-subject correlation coefficients (r_ws) between segment kinematic variables (angular range and velocities) and ball speed.

3.1.3 Relationship between angular range, angular velocity of joint, and ball speed

Negative correlations with ball speed are observed for the contralateral shoulder’s horizontal flexion-extension (r_ws = −0.44, −0.62 for joint JAR and JPAV), the playing-side elbow’s flexion-extension (r_ws = −0.35 for joint JPAV), and both knees’ flexion-extension (Figure 10, Table 7).

Figure 10

Illustrations show human-shaped figures representing angular range, mean angular velocity, peak angular velocity, and impact angular velocity. Each figure is divided into sections with numbers and colored segments indicating different data values. A color scale on the right ranges from negative to positive values, depicted in shades from blue to red.

Figure 10. Heatmap visualization of within-subject correlation between joint kinematics and ball speed. The “J” prefix in parameter abbreviations denotes joint-related measurements. Each joint is defined by three landmarks, with the middle landmark representing the joint position. Joints and their adjacent segments are color-coded, with red representing positive correlations and blue representing negative correlations. Numbers indicate anatomical landmarks.

Table 7

Table 7. Within-subject correlation coefficients (r_ws) between joint kinematic variables (angular range and velocities) and ball speed.

3.2 Between-subject correlation coefficients of age and height with ball speed

Ball speed correlates significantly with age (r_bs = 0.62), particularly in younger participants (9.1–14.3 years, r_bs = 0.68). Height also shows positive correlations with ball speed. However, among female adolescents (14.3–21.7 years), age shows weak correlation (r_bs = 0.17) while height shows slight negative correlation (r_bs = −0.29) with ball speed (Table 8, Figure 11).

Table 8

Table 8. Between-subject correlation matrix showing relationships among age, height, and ball speed across age groups.

Figure 11

Scatter plot showing ball speed in kilometers per hour versus age in years. Data points have error bars. A dashed vertical line at 14.3 years divides the x-axis. A table indicates the Spearman correlation coefficient (\$ r_bs \$) for different age groups: Total (0.62), 9.1 to 14.3 years (0.68), and 14.3 to 21.7 years (0.17).

Figure 11. Ball speed distribution by age group. Black dots represent individual player means, colored lines show 95% confidence intervals, and r_bs denotes the between-subject correlation coefficient.

4 Discussion

4.1 MediaPipe-based motion capture reveals critical kinematic parameters

Our MediaPipe-based analysis revealed specific kinematic metrics that correlate with ball speed in table tennis forehand strokes. Ball speed increased with greater playing-side arm linear movement at the shoulder, elbow and wrist (Figure 8 and Table 5), as well as with enhanced rotational motion at the playing-side upper arm, shoulder line, and hip line (Figure 9 and Table 6). Conversely, ball speed decreased with excessive contralateral shoulder horizontal flexion/extension and playing-side elbow flexion-extension (Figure 10 and Table 7). These features, derived from 33 landmarks, 19 inter-keypoint segments, and 12 joint angles, comprehensively characterize forward stroke mechanics in table tennis. They reveal biomechanical principles for optimizing body segment activation to achieve peak ball speed.

Our findings align with prior studies (43, 44), confirming that playing-side arm linear velocity and positional range directly enhance ball speed (Figure 8). Racket speed, the direct determinant of ball speed, originates from the upper limb’s kinetic chain through sequential joint velocity propagation from shoulder to wrist (4, 45). The playing-side shoulder serves as the proximal driver, generating angular momentum that transmits distally to the elbow and wrist (45). These results support previous evidence linking playing-side shoulder motion to racket speed (4, 46, 47) (Figure 10, Table 7).

The contralateral shoulder showed negative correlations between horizontal flexion-extension range and velocity and ball speed. This suggests that minimizing non-playing-side arm motion relative to the torso improves stroke efficiency. Stabilizing the contralateral shoulder through scapular muscles anchors the upper arm during forehand strokes, enhancing whole-body power transfer and movement consistency.

Researchers disagree about elbow angular velocity. Xiao et al. reported positive correlations between elbow angular velocity and ball speed (44), while Zheng et al. found no significant correlation between playing-side elbow angular velocity and ball speed (43). Chen et al. found that elite players had smaller elbow flexion angles but greater elbow flexion angular velocities at impact (48). We found weak negative correlations between playing-side elbow angular range, angular velocities and ball speed (r_ws = −0.35 to −0.17) (Table 7, Figure 10). This difference may result from different motion phase divisions compared to other studies. Further experiments are needed to validate these findings.

Hip motion critically influences trunk rotation, which forms the foundation of kinetic chain initiation. Racket speed at impact was related to the hip axial rotation torque at the playing side (49). While previous studies established the importance of hip kinematics (1–3, 46), our analysis provides higher-resolution evidence that hip positional range and velocity (particularly along the x-axis) and inter-hip angular dynamics in the xz-plane positively correlate with ball speed (Figures 8, 9).

Force transmission begins with lower limb engagement, where playing-side leg activity (positive correlations) contrasts with contralateral leg stabilization (negative correlations) (Figures 8, 9). During forward swing, weight shifts toward the playing-side leg, positioning it closer to the rotational axis to bear load, while the contralateral leg balances and stabilizes rotation. Knee flexion-extension range and velocity negatively correlated with ball speed (Figure 10), indicating that minimizing knee movement during forward swing helps maintain efficient trunk rotation. Excessive knee motion appears to compromise this rotation, likely by introducing unnecessary vertical displacement that disrupts kinetic transfer.

Previous table tennis kinematic studies used keypoint positions and linear velocities (43), body segment angles and angular velocities (1, 3, 49, 50), and joint angles and angular velocities (1–3, 48, 50). These metrics included mean values (2), peak values (2, 48), and kinematic or racket movement characteristics at impact (2, 43, 48, 49). We comprehensively applied these metrics within the MediaPipe lightweight framework and provided more intuitive analysis of these kinematic features and their relationships with ball speed. Players with extensive professional training not only generate high-speed balls but also maintain excellent body movement stability and consistency (1, 3, 49, 50). This stability is crucial for continuous, stable, high-speed striking in high-level competition.

Beyond individual performance assessment, MediaPipe-based analysis enables population-level insights into developmental trends. Between-subject correlations reveal that female players’ forehand speed increases with age and height before 14.3 years but plateaus after 14.3 years (Table 8, Figure 11). This analysis provides valuable guidance for athletes at different developmental stages. For example, young players from pre-adolescence to early adolescence should balance fundamental technical training with strength and speed development to improve ball velocity and enhance attacking capabilities. In contrast, during middle to late adolescence, players must prioritize technical, tactical, psychological, and fitness factors over reliance on physical growth to advance performance.

4.2 MediaPipe-based table tennis analysis solution

MediaPipe is an open-source framework created by Google that provides cross-platform machine learning solutions for real-time perception tasks including human pose tracking, body keypoint detection, hand tracking, facial analysis, face detection, object detection, and augmented reality applications. The framework offers superior computational efficiency with lower latency and cross-platform compatibility across Linux, macOS, Windows, Android, and iOS platforms, making it highly suitable for practical applications (12). Its vision-based approach eliminates dependency on specialized hardware, enabling flexible deployment with consumer-grade cameras while maintaining computational efficiency. The system tracks 33 anatomical landmarks across consecutive frames to model temporal kinematics of human motion, effectively balancing accuracy with low computational overhead.

Researchers investigated MediaPipe’s reliability by comparing it with widely recognized accurate optoelectronic systems (e.g., VICON and Qualisys). Hii et al. used MediaPipe 3D for gait analysis and reported good to excellent agreement across spatiotemporal parameters, with good (ICC(2,1) $> 0.75$ ) to excellent (ICC(2,1) $> 0.90$ ) agreement in all temporal gait parameters except right-to-left leg transition time (ICC(2,1) $> 0.50$ ), attributed to the very short duration (0.20 s) (51). Roggio et al. applied MediaPipe to obtain 3D joint angles (shoulder adduction, hip adduction) from 250 healthy volunteers, confirming high reliability of ML-driven posture analysis (ICC 0.67–0.95), with hip adduction showing the highest ICC (0.95) and knee valgus showing the lowest (0.67) (52). Latreche et al. compared 3D measurements with goniometer and digital inclinometer results, finding MediaPipe shoulder motion measurements all showed ICC $> 0.81$ : shoulder abduction ICC $=$ 0.968, adduction $=$ 0.99, extension $=$ 0.99, flexion $=$ 0.992, indicating excellent reliability. Mean differences were $- {0.01}^{\circ}$ compared to goniometer and $- {0.36}^{\circ}$ compared to digital inclinometer, with 95% limits of agreement confirming good validity (53).

Despite questions about MediaPipe 3D measurement accuracy, particularly at specific angles or during occlusion (13), MediaPipe 2D measurements have proven accurate and reliable (10, 54). Hamilton et al. compared MediaPipe 2D joint angles and range of motion with 3D motion capture systems (Qualisys), finding mean CV below 10% and CC $=$ 0.95, demonstrating MediaPipe 2D accuracy (54). Some researchers compute 3D coordinates through post-processing of 2D measurements using multiple cameras (14, 55). Ceriola et al. used two cameras to acquire 2D keypoints and estimated 3D coordinates through stereo triangulation, reporting minimum absolute errors of ( ${3.1}^{\circ} \pm {1.8}^{\circ}$ ) and ( ${3.5}^{\circ} \pm {1.9}^{\circ}$ ) for hip joints and ( ${4.0}^{\circ} \pm {3.7}^{\circ}$ ) and ( ${4.8}^{\circ} \pm {4.3}^{\circ}$ ) for knee joints (55). We did not map MediaPipe’s camera-based 3D coordinates to anatomical coordinate systems but preserved the original coordinates. This approach retains MediaPipe’s relatively accurate x and y values, while z-axis depth variations do not affect accuracy in the plane perpendicular to the camera axis (xy-plane).

4.3 Limitations and future works

The study has several limitations. First, despite including 8–10 forehand strokes per player, only the fastest and slowest strokes were paired to calculate within-subject correlation coefficients. Elite participants exhibited highly consistent stroke patterns, leaving minimal variations in body motions and ball speeds. Measurement errors occasionally blurred speed distinctions, misclassifying fast strokes as slow and vice versa. Prioritizing extreme-speed strokes mitigated overlap effects but reduced statistical power. Two solutions could resolve this issue: (1) integrating high-speed cameras for precise measurements, albeit at the cost of practicality, or (2) recruiting lower-skilled players, who inherently display broader ball speed variations. Future work will refine the ball speed measurement model for higher precision, expand the participant pool to include diverse skill levels.

Second, this study recruited female provincial athletes, which limits the generalizability of findings to other populations. For example, male athletes may display different kinematic characteristics due to variations in movement patterns and skill levels. Future research should include mixed-gender cohorts or develop population-specific feature models for different demographics, including gender, age, and training level.

Real-time systems offer greater value for technical diagnosis. However, implementing real-time solutions requires addressing several technical challenges: (1) Action segmentation: Deep learning models must classify continuous time-series data into discrete stroke types (e.g., forehand strokes, backhand strokes, forehand chops, and backhand chops). (2) Success/failure classification: The system must distinguish successful shots from faults by analyzing ball trajectories and automatically detecting net contacts or boundary violations. (3) Automated ball speed measurement: Current ball trajectory calibration relies on manually annotated video coordinates. Real-time automation requires machine learning approaches, such as Ji et al.’s framework (56), which integrates VOCUS-based image segmentation, LGP+Adaboost classification for smear detection, and dynamic ROI optimization to address environmental noise, motion blur, and computational delays. (4) Accurate racket-ball impact timing is essential for movement phase segmentation. Machine learning models must automatically detect trajectory discontinuities to precisely calibrate impact moments based on ball flight path changes. (5) Player movement tracking: Players move rapidly during rallies, causing partial occlusion or frame exit. Wide-angle lenses expand the field of view, while advanced deep learning algorithms can reduce occlusion effects.

5 Conclusions

This study scanned 33 skeletal landmarks, 19 segments, and 12 joints using MediaPipe to identify kinematic features linked to ball speed in table tennis forehand strokes. These features may enable lightweight technical evaluation. Ball speed increased with greater playing-side arm linear movement at the shoulder, elbow and wrist, as well as with enhanced rotational motion at the playing-side upper arm, shoulder line, and hip line. Conversely, ball speed decreased with excessive contralateral shoulder horizontal flexion/extension and playing-side elbow flexion-extension. These kinematic patterns comprehensively characterize forward stroke mechanics, providing critical metrics for technical assessment and improvement. MediaPipe demonstrated robust performance, showing high consistency during repetitive motions. Its low-cost, cross-platform compatibility, high computational efficiency, minimal hardware dependency, and open-source nature position it as a promising tool for real-time biomechanical analysis in table tennis training systems.

Data availability statement

The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: https://doi.org/10.6084/m9.figshare.28881086.

Ethics statement

The studies involving humans were approved by The Ethics Committee of Nanjing Sport Institute. The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation in this study was provided by the participants’ legal guardians/next of kin.

Author contributions

YL: Writing – original draft, Formal analysis, Software, Project administration, Writing – review & editing, Methodology, Validation, Conceptualization, Supervision, Investigation, Data curation. XD: Formal analysis, Methodology, Resources, Writing – review & editing, Investigation. CY: Writing – review & editing, Data curation, Methodology, Resources, Formal analysis, Investigation. QY: Conceptualization, Project administration, Supervision, Formal analysis, Writing – review & editing, Methodology, Validation, Software, Investigation, Resources, Data curation.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This research is supported by the Ministry of Education of Humanities and Social Science project of China (Grant No. 19YJA890032).

Acknowledgments

We express our sincere gratitude to the female athletes who participated in this study, and to Coach Fengyu Jin and Coach Haijin He for their invaluable support of this research.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declare that no Generative AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Qian J, Zhang Y, Baker JS, Gu Y. Effects of performance level on lower limb kinematics during table tennis forehand loop. Acta Bioeng Biomech. (2016) 18:149–55. doi: 10.5277/ABB-00492-2015-03

PubMed Abstract | Crossref Full Text | Google Scholar

2. Iino Y, Kojima T. Mechanical energy generation and transfer in the racket arm during table tennis topspin backhands. Sports Biomech. (2016) 15:180–97. doi: 10.1080/14763141.2016.1159722

PubMed Abstract | Crossref Full Text | Google Scholar

3. Wang M, Fu L, Gu Y, Mei Q, Fu F, Fernandez J. Comparative study of kinematics and muscle activity between elite and amateur table tennis players during topspin loop against backspin movements. J Hum Kinet. (2018) 64:25–33. doi: 10.1515/hukin-2017-0182

PubMed Abstract | Crossref Full Text | Google Scholar

4. Iino Y, Kojima T. Kinetics of the upper limb during table tennis topspin forehands in advanced and intermediate players. Sports Biomech. (2011) 10:361–77. doi: 10.1080/14763141.2011.629304

PubMed Abstract | Crossref Full Text | Google Scholar

5. Zhu R, Yang X, Chong LC, Shao S, István B, Gu Y. Biomechanics of topspin forehand loop in table tennis: an application of opensim musculoskeletal modelling. Healthcare. (2023) 11:1216. doi: 10.3390/healthcare11091216

PubMed Abstract | Crossref Full Text | Google Scholar

6. Tabrizi SS, Pashazadeh S, Javani V. Data acquired by a single object sensor for the detection and quality evaluation of table tennis forehand strokes. Data Brief. (2020) 33:106504. doi: 10.1016/j.dib.2020.106504

PubMed Abstract | Crossref Full Text | Google Scholar

7. He Y, Lyu X, Sun D, Baker JS, Gu Y. The kinematic analysis of the lower limb during topspin forehand loop between different level table tennis athletes. PeerJ. (2021) 9:e10841. doi: 10.7717/peerj.10841

PubMed Abstract | Crossref Full Text | Google Scholar

8. Chen P, Shen Q. Research on table tennis swing recognition based on lightweight openpose. In: 2023 16th International Conference on Advanced Computer Theory and Engineering (ICACTE) (2023). p. 207–12. doi: 10.1109/ICACTE59887.2023.10335442

Crossref Full Text | Google Scholar

9. Roggio F, Trovato B, Sortino M, Musumeci G. A comprehensive analysis of the machine learning pose estimation models used in human movement and posture analyses: a narrative review. Heliyon. (2024) 10:e39977. doi: 10.1016/j.heliyon.2024.e39977

PubMed Abstract | Crossref Full Text | Google Scholar

10. Simoes W, Reis L, Araujo C, Maia Jr J. Accuracy assessment of 2D pose estimation with MediaPipe for physiotherapy exercises. Procedia Comput Sci. (2024) 251:446–53. doi: 10.1016/j.procs.2024.11.132

Crossref Full Text | Google Scholar

11. Llanos MJ, Obrero JR, Alvarez LM, Yang CH, Aliac CJ. Computer-assisted table tennis posture analysis using machine learning. In: 2022 IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET) (2022). p. 1–6. doi: 10.1109/IICAIET55139.2022.9936806

Crossref Full Text | Google Scholar

12. He Z, Yang Z, Xu J, Chen H, Li X, Wang A, et al. Real-time accurate determination of table tennis ball and evaluation of player stroke effectiveness with computer vision-based deep learning. Appl Sci. (2025) 15:5370. doi: 10.3390/app15105370

Crossref Full Text | Google Scholar

13. Dill S, Rösch A, Rohr M, Güney G, De Witte L, Schwartz E, et al. Accuracy evaluation of 3D pose estimation with MediaPipe pose for physical exercises. Curr Direct Biomed Eng. (2023) 9:563–6. doi: 10.1515/cdbme-2023-1141

Crossref Full Text | Google Scholar

14. Dill S, Ahmadi A, Grimmer M, Haufe D, Rohr M, Zhao Y, et al. Accuracy evaluation of 3d pose reconstruction algorithms through stereo camera information fusion for physical exercises with mediapipe pose. Sensors. (2024) 24:7772. doi: 10.3390/s24237772

PubMed Abstract | Crossref Full Text | Google Scholar

15. Xu D, Zhou H, Quan W, Jiang X, Liang M, Li S, et al. A new method proposed for realizing human gait pattern recognition: inspirations for the application of sports and clinical gait analysis. Gait Posture. (2024) 107:293–305. doi: 10.1016/j.gaitpost.2023.10.019

PubMed Abstract | Crossref Full Text | Google Scholar

16. Wu C, Xu Y, Fang J, Li Q. Machine learning in biomaterials, biomechanics/mechanobiology, and biofabrication: state of the art and perspective. Arch Comput Methods Eng. (2024) 31:3699–765. doi: 10.1007/s11831-024-10100-y

Crossref Full Text | Google Scholar

17. Donisi L, Cesarelli G, Capodaglio E, Panigazzi M, D’Addio G, Cesarelli M, et al. A logistic regression model for biomechanical risk classification in lifting tasks. Diagnostics. (2022) 12:2624. doi: 10.3390/diagnostics12112624

PubMed Abstract | Crossref Full Text | Google Scholar

18. Myer GD, Ford KR, Khoury J, Succop P, Hewett TE. Biomechanics laboratory-based prediction algorithm to identify female athletes with high knee loads that increase risk of ACL injury. Br J Sports Med. (2011) 45:245–52. doi: 10.1136/bjsm.2009.069351

PubMed Abstract | Crossref Full Text | Google Scholar

19. Acikkar M, Akay MF, Ozgunen KT, Aydin K, Kurdak SS. Support vector machines for aerobic fitness prediction of athletes. Expert Syst Appl. (2009) 36:3596–602. doi: 10.1016/j.eswa.2008.02.002

Crossref Full Text | Google Scholar

20. Phinyomark A, Hettinga BA, Osis ST, Ferber R. Gender and age-related differences in bilateral lower extremity mechanics during treadmill running. PLoS One. (2014) 9:e105246. doi: 10.1371/journal.pone.0105246

PubMed Abstract | Crossref Full Text | Google Scholar

21. Dey S, Yoshida T, Ernst M, Schmalz T, Schilling AF. A random forest approach for continuous prediction of joint angles and moments during walking: an implication for controlling active knee-ankle prostheses/orthoses. In: 2019 IEEE International Conference on Cyborg and Bionic Systems (CBS) (2019). p. 66–71. doi: 10.1109/CBS46900.2019.9114439

Crossref Full Text | Google Scholar

22. Patoz A, Lussiana T, Breine B, Gindre C, Malatesta D. Comparison of different machine learning models to enhance sacral acceleration-based estimations of running stride temporal variables and peak vertical ground reaction force. Sports Biomech. (2023) 24:825–41. doi: 10.1080/14763141.2022.2159870

PubMed Abstract | Crossref Full Text | Google Scholar

23. Nicholson K, Collins G, Waterman B, Bullock G. Machine learning and statistical prediction of fastball velocity with biomechanical predictors. J Biomech. (2022) 134:110999. doi: 10.1016/j.jbiomech.2022.110999

PubMed Abstract | Crossref Full Text | Google Scholar

24. Palmieri-Smith RM, Curran MT, Garcia SA, Krishnan C. Factors that predict sagittal plane knee biomechanical symmetry after anterior cruciate ligament reconstruction: a decision tree analysis. Sports Health. (2022) 14:167–75. doi: 10.1177/19417381211004932

PubMed Abstract | Crossref Full Text | Google Scholar

25. Ghaneei M, Ekyalimpa R, Westover L, Parent EC, Adeeb S. Customized k-nearest neighbourhood analysis in the management of adolescent idiopathic scoliosis using 3D markerless asymmetry analysis. Comput Methods Biomech Biomed Eng. (2019) 22:696–705. doi: 10.1080/10255842.2019.1584795

PubMed Abstract | Crossref Full Text | Google Scholar

26. Wang Z, Chen C, Chen H, Zhou Y, Wang X, Wu X. Dual transformer network for predicting joint angles and torques from multi-channel EMG signals in the lower limbs. IEEE J Biomed Health Inform. (2025):1–13. doi: 10.1109/JBHI.2025.3555255

Crossref Full Text | Google Scholar

27. Liu Q, Mo S, Cheung VC, Cheung BM, Wang S, Chan PP, et al. Classification of runners’ performance levels with concurrent prediction of biomechanical parameters using data from inertial measurement units. J Biomech. (2020) 112:110072. doi: 10.1016/j.jbiomech.2020.110072

PubMed Abstract | Crossref Full Text | Google Scholar

28. Liew BX, Rügamer D, Zhai X, Wang Y, Morris S, Netto K. Comparing shallow, deep, and transfer learning in predicting joint moments in running. J Biomech. (2021) 129:110820. doi: 10.1016/j.jbiomech.2021.110820

PubMed Abstract | Crossref Full Text | Google Scholar

29. Carneros-Prado D, Dobrescu CC, Cabañero L, Villa L, Altamirano-Flores YV, Lopez-Nava IH, et al. Synthetic 3D full-body skeletal motion from 2D paths using RNN with LSTM cells and linear networks. Comput Biol Med. (2024) 180:108943. doi: 10.1016/j.compbiomed.2024.108943

PubMed Abstract | Crossref Full Text | Google Scholar

30. Dorschky E, Nitschke M, Martindale CF, Koelewijn AD, Eskofier BM. CNN-based estimation of sagittal plane walking and running biomechanics from measured and simulated inertial sensor data. Front Bioeng Biotechnol. (2020) 8:604. doi: 10.3389/fbioe.2020.00604

PubMed Abstract | Crossref Full Text | Google Scholar

31. Hossain MSB, Dranetz J, Choi H, Guo Z. Deepbbwae-net: a CNN-RNN based deep superlearner for estimating lower extremity sagittal plane joint kinematics using shoe-mounted IMU sensors in daily living. IEEE J Biomed Health Inform. (2022) 26:3906–17. doi: 10.1109/JBHI.2022.3165383

PubMed Abstract | Crossref Full Text | Google Scholar

32. Burenbatu B. Comparative analysis of biomechanical patterns in sprinting: a machine learning approach to optimize running performance in track athletes. Mol Cell Biomech. (2024) 21:321–. doi: 10.62617/mcb.v21i1.321

Crossref Full Text | Google Scholar

33. Ballit A, Dao TT. Recurrent neural network to predict hyperelastic constitutive behaviors of the skeletal muscle. Med Biol Eng Comput. (2022) 60:1177–85.doi: 10.1007/s11517-022-02541-z

PubMed Abstract | Crossref Full Text | Google Scholar

34. Xu D, Zhou H, Quan W, Gusztav F, Wang M, Baker JS, et al. Accurately and effectively predict the ACL force: utilizing biomechanical landing pattern before and after-fatigue. Comput Methods Programs Biomed. (2023) 241:107761. doi: 10.1016/j.cmpb.2023.107761

PubMed Abstract | Crossref Full Text | Google Scholar

35. Mubarrat ST, Chowdhury S. Convolutional LSTM: a deep learning approach to predict shoulder joint reaction forces. Comput Methods Biomech Biomed Eng. (2023) 26:65–77. doi: 10.1080/10255842.2022.2045974

PubMed Abstract | Crossref Full Text | Google Scholar

36. Lu Y, Wang H, Qi Y, Xi H. Evaluation of classification performance in human lower limb jump phases of signal correlation information and LSTM models. Biomed Signal Process Control. (2021) 64:102279. doi: 10.1016/j.bspc.2020.102279

Crossref Full Text | Google Scholar

37. Huang W, Yang J, Luo H, Zhang H. Human table tennis actions recognition and evaluation method based on skeleton extraction. In: 2023 3rd International Conference on Consumer Electronics and Computer Engineering (ICCECE) (2023). p. 7–13. doi: 10.1109/ICCECE58074.2023.10135318

Crossref Full Text | Google Scholar

38. Dong K, Yan WQ. Player performance analysis in table tennis through human action recognition. Computers. (2024) 13:332. doi: 10.3390/computers13120332

Crossref Full Text | Google Scholar

39. Bazarevsky V, Grishchenko I. Google AI Blog: on-device, real-time body pose tracking with mediapipe blazepose (2020). doi: 10.48550/arXiv.2006.10204

Crossref Full Text | Google Scholar

40. Sharma N, Sree BS, Aranha VP, Samuel AJ. Repeated measures correlation between functional capacity, pulmonary function and chest expansion in children undergoing open abdominal surgery: secondary analysis from randomized clinical trial. J Pediatr Surg. (2021) 56:2022–6. doi: 10.1016/j.jpedsurg.2020.12.006

PubMed Abstract | Crossref Full Text | Google Scholar

41. Bland JM, Altman DG. Calculating correlation coefficients with repeated observations: part 2—correlation between subjects. BMJ. (1995) 310:633. REFDOIdoi: 10.1136/bmj.310.6980.6337703752

PubMed Abstract | Google Scholar

42. Bakdash JZ, Marusich LR. Repeated measures correlation. Front Psychol. (2017) 8:456. doi: 10.3389/fpsyg.2017.00456

PubMed Abstract | Crossref Full Text | Google Scholar

43. Zheng C, Lu M, Zeng Y, Hu M, Geng X, Xiao Y. The impact of wrist joint movement on stroke effect during topspin forehand in table tennis. Int J Perform Anal Sport. (2021) 21:324–35. doi: 10.1080/24748668.2021.1885839

Crossref Full Text | Google Scholar

44. Xiao Y, Xiao Y, Lu M, Zeng Y. The correlation between stroke characteristics and stroke effect of young table tennis players. J Sports Med Phys Fitness. (2021) 61:1454–63. doi: 10.23736/S0022-4707.20.11800-0

PubMed Abstract | Crossref Full Text | Google Scholar

45. Putnam CA. Sequential motions of body segments in striking and throwing skills: descriptions and explanations. J Biomech. (1993) 26:125–35. doi: 10.1016/0021-9290(93)90084-R

PubMed Abstract | Crossref Full Text | Google Scholar

46. Iino Y, Kojima T. Kinematics of table tennis topspin forehands: effects of performance level and ball spin. J Sports Sci. (2009) 27:1311–21. doi: 10.1080/02640410903264458

PubMed Abstract | Crossref Full Text | Google Scholar

47. Bańkosz Z, Winiarski S. Correlations between angular velocities in selected joints and velocity of table tennis racket during topspin forehand and backhand. J Sports Sci Med. (2018) 17:330–8.

Google Scholar

48. Chen MZ, Wang X, Chen Q, Ma Y, Lanzoni IM, Lam WK. An analysis of whole-body kinematics, muscle strength and activity during cross-step topspin among table tennis players. Int J Perform Anal Sport. (2022) 22:16–28. doi: 10.1080/24748668.2022.2025712

Crossref Full Text | Google Scholar

49. Wong DWC, Lee WCC, Lam WK. Biomechanics of table tennis: a systematic scoping review of playing levels and maneuvers. Appl Sci. (2020) 10:5203. doi: 10.3390/app10155203

Crossref Full Text | Google Scholar

50. Yu C, Shao S, Baker JS, Awrejcewicz J, Gu Y. A comparative biomechanical analysis of the performance level on chasse step in table tennis. Int J Sports Sci Coach. (2019) 14:372–82. doi: 10.1177/1747954119843651

Crossref Full Text | Google Scholar

51. Hii CST, Gan KB, Zainal N, Ibrahim NM, Azmin S, Desa SHM, et al. Automated gait analysis based on a marker-free pose estimation model. Sensors. (2023) 23:6489. doi: 10.3390/s23146489

PubMed Abstract | Crossref Full Text | Google Scholar

52. Roggio F, Di Grande S, Cavalieri S, Falla D, Musumeci G. Biomechanical posture analysis in healthy adults with machine learning: applicability and reliability. Sensors. (2024) 24:2929. doi: 10.3390/s24092929

PubMed Abstract | Crossref Full Text | Google Scholar

53. Latreche A, Kelaiaia R, Chemori A, Kerboua A. Reliability and validity analysis of mediapipe-based measurement system for some human rehabilitation motions. Measurement. (2023) 214:112826. doi: 10.1016/j.measurement.2023.112826

Crossref Full Text | Google Scholar

54. Hamilton RI, Glavcheva-Laleva Z, Haque Milon MI, Anil Y, Williams J, Bishop P, et al. Comparison of computational pose estimation models for joint angles with 3D motion capture. J Bodyw Mov Ther. (2024) 40:315–9. doi: 10.1016/j.jbmt.2024.04.033

PubMed Abstract | Crossref Full Text | Google Scholar

55. Ceriola L, Taborri J, Donati M, Rossi S, Patanè F, Mileti I. Comparative analysis of markerless motion capture systems for measuring human kinematics. IEEE Sens J. (2024) 24:28135–44. doi: 10.1109/JSEN.2024.3431873

Crossref Full Text | Google Scholar

56. Ji YF, Zhang JW, Shi ZH, Liu MH, Ren J. Research on real-time tracking of table tennis ball based on machine learning with low-speed camera. Syst Sci Control Eng. (2018) 6:71–9. doi: 10.1080/21642583.2018.1450167

Crossref Full Text | Google Scholar

Keywords: motion analysis, MediaPipe, table tennis, forehand stroke, biomechanics

Citation: Lyu Y, Duan X, Yang C and Ye Q (2025) Development of a MediaPipe-based framework for biomechanical quantification of table tennis forehand strokes. Front. Sports Act. Living 7:1635581. doi: 10.3389/fspor.2025.1635581

Received: 26 May 2025; Accepted: 28 July 2025;
Published: 15 August 2025.

Edited by:

Lei Zhao, Shandong Jianzhu University, China

Reviewed by:

Datao Xu, Ningbo University, China
Guoxin Zhang, The Hong Kong Polytechnic University, Hong Kong SAR, China

Copyright: © 2025 Lyu, Duan, Yang and Ye. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Qiang Ye, eWVxaWFuZ0Buc2kuZWR1LmNu

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.