- 1State Key Laboratory of Female Fertility Promotion, Center for Reproductive Medicine, Department of Obstetrics and Gynecology, Peking University Third Hospital, Beijing, China
- 2National Clinical Research Center for Obstetrics and Gynecology (Peking University Third Hospital), Beijing, China
- 3Key Laboratory of Assisted Reproduction (Peking University), Ministry of Education, Beijing, China
- 4Beijing Key Laboratory of Reproductive Endocrinology and Assisted Reproductive Technology, Beijing, China
- 5National Clinical Key Specialty Construction Program, China (2023), Beijing, China
- 6Big Data Center, Beijing Children’s Hospital, Capital Medical University, National Center for Children’s Health, Beijing, China
- 7Department of Thoracic Surgery, Zhejiang Provincial People’s Hospital, Affiliated People’s Hospital, Hangzhou Medical College, Hangzhou, Zhejiang, China
Background: Women display marked variability in ovarian reserve, which is pivotal for fertility and menopausal timing. Traditional criteria, such as Bologna and Poseidon, classify women into broad groups but do not provide individualized predictions for ovarian aging or reproductive milestones. This study aims to refine the AA model (AMH + age) to enhance clinical usability, robustness, and interpretability.
Materials and Methods: Single-center retrospective ART cohort (GnRH-antagonist cycles, 2017–2021). Training: 15,241 cycles (2017–2019); Testing: 14,498 cycles (2020–2021). Poor ovarian response (POR) was defined as <5 oocytes. Three logistic-regression specifications were compared: categorical (Model-0), continuous (Model-1), and polynomial (age quadratic, AMH cubic; Model-2). Discrimination (AUC), calibration, and net reclassification improvement (NRI) were evaluated. A two-parameter logistic curve was fitted to age versus predicted POR (used population-level as “predicted DOR”) to construct an ovarian-aging trajectory and derive an interpretable “endocrine-age” index. Sensitivity analyses assessed cycle-day AMH variation; a community dataset was used to compare age-stratified AMH distributions.
Results: While all models achieved comparable discrimination (AUC ≈ 0.85), a cubic transformation model (Model-2) demonstrated superior calibration and was selected as the final algorithm. A two-parameter logistic curve allowed translation of ovarian reserve scores into an “endocrine age” and enabled individualized prediction of future milestones, such as diminished reserve with ovarian score of 50 and perimenopause, the lowest ovarian reserve score in our ART population. AMH sampling on different cycle days showed only modest effects from minor fluctuations; only substantial AMH decreases significantly affected prediction accuracy. Age-stratified AMH distributions were similar between ART and community cohorts in women <40, supporting external relevance. The updated OvaRePred (HerTempo) model is cost-effective, scalable, and operationally simple.
Conclusion: OvaRePred (HerTempo) delivers individualized, well-calibrated estimates of ovarian reserve and an interpretable endocrine-age index and future fertility milestone onset. While the tool can inform personalized fertility planning and may have broader public-health utility, the algorithm is trained on ART endpoints. Any projections of future reproductive milestones derived from the population ovarian-aging curve—and the fixed-interval hypothesis that underpins that curve—are hypothesis-generating and require prospective validation, particularly in non-ART cohorts with longitudinal follow-up.
Introduction
As is well known, women lose their fertility at menopause due to the depletion of ovarian reserve, which is widely considered the most critical factor affecting fertility (1, 2). However, what is less appreciated is the significant inter-individual variability in ovarian reserve (3, 4). Ovarian reserve, defined by the number of primordial follicles, exhibits marked differences even at birth, from thousands to millions (1). Although the rate of ovarian aging is relatively consistent among individuals, these inherent variations in ovarian reserve lead to a wide disparity in the age at menopause (1, 5). The Bologna (6) and Poseidon (7) criterias for poor ovarian response, although commonly used for assessing ovarian reserve, have notable limitations. Specifically, the Bologna standard categorizes patients into only two groups, while the Poseidon criteria, despite subdividing into four categories, still do not fully capture individual differences. Additionally, both standards heavily rely on ultrasound-based antral follicle count (AFC) assessments, which typically require transvaginal ultrasound for greater accuracy. This method, however, can cause discomfort for some women and is subject to variability due to differences in equipment and operator proficiency. Moreover, they assess ovarian reserve only at a single point in time without providing dynamic predictions of future changes. In light of these deficiencies, and given the vast individual differences in ovarian reserve, the ability to perform individualized ovarian reserve assessments and to predict reproductive milestones is of paramount importance.
In 2023, the reproductive center team at Peking University Third Hospital leveraged their extensive big data resources to introduce the OvaRePred (HerTempo) tool for ovarian reserve evaluation and future milestone prediction (8, 9). This tool has since been adopted in multiple hospitals across China, including Peking University Third Hospital, as well as in health examination centers and online platforms, with favorable patient feedback. OvaRePred (HerTempo) provides a ranking of ovarian reserve from optimal to poor based on the probability of a poor ovarian response (POR), generating a score for current ovarian reserve. Furthermore, by integrating the Fixed Interval theory—which posits that the ovarian aging curves of populations are similar and follow an S-shaped curve—our team has constructed a population-level ovarian aging curve. This enables the tool not only to assess current ovarian reserve but also to predict the age at which critical reproductive milestones occur, such as the onset of diminished ovarian reserve (DOR, corresponding to an ovarian reserve score of 50) and the beginning of perimenopause (marked by the lowest ovarian reserve score in our ART populations).
OvaRePred (HerTempo) proposes three prediction models tailored to different clinical scenarios—AAFA (AMH-AFC-FSH-Age), AFA (AMH-FSH-Age), and AA (AMH-Age)—for evaluating ovarian reserve and forecasting subsequent reproductive milestones. Although these models incorporate different combinations of variables, their predictive performance is statistically similar, suggesting that a simpler model may suffice in clinical practice. In particular, the AA model, which relies solely on anti-Müllerian hormone (AMH) and age, offers the advantages of simplicity and cost-effectiveness while maintaining high predictive accuracy (10). Moreover, AMH measurement is not time-restricted and blood can be drawn on any day of the menstrual cycle, the AA model offers greater flexibility and convenience. However, both the AFA and AAFA models depend on FSH measurements obtained via precisely timed blood draws during the menstrual cycle, and the AAFA model further requires antral follicle counts by transvaginal ultrasound—adding procedural complexity, higher costs, and greater patient discomfort. Thus, our current study focuses on updating and refining the AA model using larger datasets and advanced statistical techniques, with the ultimate goal of establishing a robust, high-performing ovarian reserve assessment and reproductive milestone prediction tool that is both user-friendly and operationally efficient.
Our original AA model (10) employed categorical transformations for its independent variables. Categorical variable models offer simplicity and ease of interpretation by grouping data to capture overall trends, maintain stability, and resist the influence of outliers (11, 12). While this method effectively captures general trends and ensures stability, it falls short in reflecting finer changes. In contrast, continuous variable models retain the complete spectrum of the original data and, through appropriate transformations (such as a cubic transformation), can more accurately capture the nonlinear relationships between variables. This leads to enhanced discriminative power and improved predictive accuracy. However, continuous models are more complex to construct and interpret, are more sensitive to outliers, and may risk overfitting, particularly with smaller sample sizes (12). For these reasons, we aim to update our AA model by exploring various transformations of the independent variables. Our goal was to achieve better discrimination and calibration compared to the original model.
Despite its clinical promise, the original AA model—which categorizes AMH and age—may be too coarse to capture subtle nonlinear relationships, and its calibration can be further improved. Meanwhile, although the AFA and AAFA models achieve similar discrimination, their reliance on cycle−timed FSH measurements and transvaginal AFC assessment increases complexity, cost, and patient burden. To address these gaps, the present study leverages a substantially expanded single−center ART dataset to explore different transformations of AMH and age, evaluate the variations of AMH on model performance, and rigorously compare model discrimination, calibration, and reclassification performance. Our aim is to refine the AA model into a streamlined, high−precision tool that maintains interpretability and operational efficiency, thereby enhancing personalized ovarian reserve assessment, endocrine age evaluation and the prediction of key reproductive milestones.
Materials and methods
Study population and data sources
This retrospective, single-center analysis enrolled women undergoing controlled ovarian stimulation with gonadotropin-releasing hormone (GnRH) antagonist protocols from 2017 to 2021. All participants received stimulation at our center, with serum AMH levels, chronological age, and pertinent clinical variables recorded at baseline. No apparent outliers in AMH or age were identified in the exported database; thus, no data exclusion for outliers was performed. Retaining the full spectrum of observed AMH and age values enhances the model’s generalizability, particularly for individuals at the extremes of ovarian reserve. Initially, 16,327 antagonist cycles from January 2017 to December 2019 constituted the training cohort, and 15,596 cycles from January 2020 to December 2021 comprised the independent validation cohort. After removing multiple cycles contributed by the same women to address the clustering issue, the final numbers of cycles included in the analysis were 15,241 for 2017–2019 and 14,498 for 2020–2021. Poor ovarian response (POR) was defined as retrieval of fewer than five oocytes. As this study aims to update our previous AA model, Table 1 reports only the distributions of AMH and age according to POR status. This study was approved by the Institutional Review Board of Peking University Third Hospital (approval number: 2015sz-017).
Clarification of POR and DOR definitions
To avoid ambiguity, we clarify the distinct definitions of poor ovarian response (POR) used in this study. Clinically, POR was defined as retrieval of fewer than five oocytes, and this criterion was applied consistently for all modeling and validation analyses. For constructing the ovarian aging curve, we defined predicted POR (or diminished ovarian reserve, DOR) using a probability threshold of 0.15, reflecting the actual incidence of POR in our cohort, rather than the standard 0.5 binary classification threshold used in the software. We recognize that these differing definitions may cause confusion; therefore, we emphasize that the clinical definition (fewer than five oocytes) is used for model evaluation, while the 0.15 threshold is employed specifically for population-level aging curve construction.
Statistical analysis
A logistic regression framework was used to model the risk of POR (< 5 oocytes retrieved), facilitating direct comparison with our previously published model (Model-0). Three candidate specifications for age and AMH were evaluated:
1. Model-1 (continuous): age and AMH were included as untransformed continuous variables.
2. Model-2 (polynomial): age was represented by a quadratic term and AMH by a cubic term, based on exploratory generalized additive modeling.
Model discrimination was quantified by the area under the receiver operating characteristic curve (AUC). Calibration was assessed via calibration plots comparing predicted probabilities with observed POR rates. Incremental improvement over Model-0 was measured using the net reclassification improvement (NRI) index.
Subsequently, the relationship between the proportion of predicted POR (also defined as predicted DOR in our study) and age was characterized. Given the anticipated sigmoidal pattern, a logistic growth curve was fitted to the age-DOR data.
To evaluate whether the AMH–age model derived from assisted reproductive technology (ART) data is generalizable to a broader population, we conducted an exploratory comparison of AMH distributions between our single-center ART cohort and a community-based epidemiologic survey13. Participants in both datasets were stratified into three age groups: ≤30 years, >30–≤40 years, and >40 years. For each age group, we plotted the empirical distribution of serum AMH concentrations and compared their medians and interquartile ranges.
All analyses were conducted in JMP Pro 17.0 (SAS Institute) and R 4.4.1, with statistical significance defined as two-sided p < 0.05.
Results
Ovarian reserve scoring construction
Prediction of poor ovarian response
This study aims to optimize the original AA model (10)—which utilizes AMH and age to predict the probability of POR—by expanding the sample size and performing a comprehensive analysis and appropriate transformation of the independent variables (AMH and age). Three distinct models were developed: the original model with categorical transformation of independent variables (Model-0), a new model without any transformation (Model-1), and a new model employing a cubic transformation of AMH and a quadratic term of age, based on exploratory generalized additive modeling. (Model-2).
Comparisons using NRI, ROC curves (Table 2), and calibration curves (Figure 1) led to the following key conclusions. While differences in ROC and NRI metrics were not markedly significant, the calibration curve of Model-2 was superior to the other models. Figure 1 compares the calibration of Model-0, Model-1, Model-2, and the Ideal model across both the training (2017–2019) and test (2020–2021) datasets. Model-2 consistently performs the best, with its curve closely aligning with the Ideal model, especially in the mid and low probability ranges. Although Model-1 also performs well, Model-0 shows significant deviations compared to the other models. The lower panel zooms in on the 0-0.25 probability range, which is particularly important as it represents 82.3% of the female population in our dataset. In this zoomed-in view, Model-2 again outperforms the other models, demonstrating superior calibration and making it the most reliable for predicting outcomes in this most common probability range.

Figure 1. Calibration of POR prediction models. (A) Training cohort (2017–2019); (B) Test cohort (2020–2021). Upper panels cover the full 0–1.0 range; lower panels zoom to 0–0.25. Dashed black = ideal. Model−0 (green) overestimates risk < 0.60; Model−1 (purple) improves fit; Model−2 (red) tracks the ideal line most closely.
Based on these findings, we have chosen the Model-2 as the final version of the updated model. In clinical practice, model calibration is of paramount importance. Good calibration ensures that the predicted incidence of POR accurately reflects the true incidence within the population, thereby guaranteeing that each predicted value closely approximates the actual occurrence rate. This accurate incidence estimation serves as a crucial foundation for subsequent predictions, such as the age at the onset of perimenopause.
Ovarian reserve score conversion
In the OvaRePred (HerTempo) model, we convert the predicted probability of POR into an ovarian reserve score using the formula: (1 – predicted probability of POR) × 100. This conversion inversely maps the risk of POR to the quanlity of ovarian reserve: a higher score indicates a lower risk of POR and a better ovarian reserve, whereas a lower score suggests a higher risk of POR and a poorer ovarian reserve. This transformation not only standardizes the expression of the predicted probabilities but also makes the evaluation results more intuitive, thereby aiding clinicians in risk communication and decision-making.
Ovarian aging curve and determination of endocrine age
In our previous work, we constructed an ovarian aging curve using cross−sectional data based on the fixed−interval theory (2, 5), achieving an r² of 0.978 (8). In this study, we aim to further optimize that ovarian aging curve by incorporating more data and exploring additional methods. First, we recognize that ovarian aging follows a sigmoidal (S−shaped) trajectory. We therefore applied a two−parameter logistic curve to depict the trend in the predicted prevalence of POR across ages. DOR, predicted POR, was defined based on the newly developed Model-2, which applies a cubic transformation to AMH. Using an actual POR incidence rate of 0.15 as the cut-off to define POR, we plotted the proportion of DOR against age, as shown in Figure 2. The logistic curve model achieved an r² of 0.989—an improvement over the original 0.978—demonstrating superior overall fit. However, both the youngest and oldest age ranges show poorer alignment with the fitted curve, likely because fewer participants occupy these extremes, reducing the sample’s representativeness and statistical power there. In addition, increased biological variability and potential outliers at the tails can further weaken the model’s ability to capture true trends in those regions.

Figure 2. Age−related trajectory of female reproductive decline. (A) Fixed Interval Theory, from an early 20th−century cross−sectional study without birth control, proposes that age intervals between fertility states are relatively fixed (2, 5). We reconstructed its conceptual schematic by redrawing from the original reference, mapping four key milestones—subfertility, sterility, menstrual irregularity onset, and menopause—as sigmoidal functions of age. The 50% probability thresholds lie at approximately 31, 41, 46, and 51 years, respectively, illustrating a predictable, age−related progression of reproductive decline (5). (B) Empirical ovarian−aging curve derived from 31,923 first−cycle ART patients: observed proportions of diminished ovarian reserve (DOR, black dots) plotted against age with a two−parameter logistic fit. The close agreement confirms an S−shaped rise in DOR risk that underpins panel (A) schematic.
Using the fitted S−shaped curve, we can map any individual’s predicted probability of POR, also called predicted DOR in our study, onto the ovarian aging curve to derive their “endocrine age.” Here, ovarian endocrine age is an index that projects an individual’s ovarian reserve score onto the population ovarian aging trajectory, with the goal of reflecting the relative state of their ovarian function rather than their chronological age.
Specifically, by measuring AMH levels and noting the subject’s chronological age, we calculate a predicted probability of POR. We then locate that probability on the S−shaped ovarian aging curve constructed from large−scale population data; the age at which the curve reaches that probability is defined as the ovarian endocrine age. If a young woman’s ovarian reserve score corresponds to an endocrine age substantially higher than her chronological age, this may indicate accelerated ovarian decline and a potentially earlier drop in fertility. Conversely, if her endocrine age is close to or below her chronological age, it suggests her ovarian function is relatively well preserved. In this way, ovarian endocrine age reflects the true status of ovarian reserve and, compared with chronological age, more accurately reveals the degree and rate of ovarian aging.
Predicting the age of onset for future fertility milestones
The “fixed-interval hypothesis” assumes that an individual’s ovarian aging trajectory shares the same shape as the population’s average curve—meaning the time intervals between successive stages of ovarian decline are constant, differing only in their starting points. In other words, although individuals may begin with different ovarian reserve levels, the overall pattern of functional decline over time can be described by a single sigmoid (S-shaped) curve (5). Evidence supporting this hypothesis primarily comes from cross-sectional observations (2). This hypothesis provides the theoretical basis and methodological support for predicting future ovarian reserve changes from a cross-sectional data.
On this basis, we derived the time intervals required for an individual to reach specific ovarian reserve states (predicted probability of POR). First, we map the subject’s current ovarian reserve score onto the S-curve to determine her position within the overall aging process. Next, using the fitted curve, we calculate the interval needed to drop from the current score to a predefined POR probability (e.g., a specific POR probability, which correspond to a specific ovarian reserve score), then converted these intervals into ages at which she would reach each milestone. We implemented this functionality in a software tool (see Figure 3). The program uses AMH and chronological age to calculate the user’s current reserve score and “endocrine age,” then predicts the ages at which she will reach a score of 50 and enter perimenopause (the lowest reserve score observed in an ART population). We have also updated these algorithms into an online tool (http://121.43.113.123:8005/).

Figure 3. The OvaRePred software interface for ovarian reserve assessment and menopause prediction. The left panel captures patient demographics and AMH data, while the right panel presents outputs: ovarian reserve score with grade, endocrine ovarian age, estimated ages at 50−point reserve and perimenopause onset, and risk assessment.
AMH distributions across age groups in ART vs. community populations
In our study, serum AMH concentrations from the ART cohort—comprising women undergoing assisted reproductive technology treatments—served as the primary data source. The median AMH values for women aged ≤ 30 years, > 30–≤ 40 years, and > 40 years were 3.65 ng/mL, 2.26 ng/mL, and 0.98 ng/mL, respectively. These were compared with median AMH levels reported in previously published population-based epidemiological studies representing the general female population, which showed values of 3.89 ng/mL, 2.28 ng/mL, and 0.34 ng/mL for the corresponding age groups (13). For women under 40, the empirical AMH distributions in the ART and general population cohorts were nearly identical, whereas in women over 40, the community data demonstrated a lower, left-shifted distribution (Figure 4), distributions are nearly identical in younger groups; a divergence appears in women ≥40, reflecting differences in sample composition. Likely reflecting the inclusion of a more representative cross-section of women in the general population survey. Since women under 40 made up the majority of both cohorts and exhibited highly similar AMH profiles, these findings indicate that the AMH–age relationship derived from our ART cohort is largely generalizable to the wider population. Detailed percentiles of AMH concentrations by age group for both cohorts are provided in Supplementary Table 1.
![Three density plots showing the distribution of ln[AMH] levels in ART (blue) and general population (red) groups across three age strata: ≤30 years, 30–40 years, and ≥40 years. For each stratum, sample sizes are indicated.• In women ≤30 and 30–40, the two populations show largely overlapping distributions.• In women ≥40, the distributions diverge: the ART population shows a peak shifted toward higher ln[AMH].](https://www.frontiersin.org/files/Articles/1658068/fendo-16-1658068-HTML/image_m/fendo-16-1658068-g004.jpg)
Figure 4. Age−specific log−transformed AMH distributions in ART versus general populations. Log−transformed AMH distributions for assisted−reproduction (ART, blue) and community−based general (red) populations stratified by age: ≤30 years (ART = 10 365, general = 2 417), 30–≤40 years (ART = 19 237, general = 4 342) and ≥40 years (ART = 2 321, general = 3 356). Distributions are nearly identical in younger groups; a divergence appears in women ≥40, reflecting differences in sample composition.
Discussion
Advancing ovarian reserve assessment: from categorical to continuous modeling in OvaRePred(HerTempo) optimization
Ovarian reserve is the primary determinant of female fertility (14). With advancing age or under disease impact, ovarian reserve gradually becomes depleted and fertility correspondingly declines until exhaustion. Assessing ovarian reserve enables early detection of insufficiency and supports personalized fertility planning and health management for women. This study aims to update and optimize the algorithms underlying the previous OvaRePred (HerTempo) tool using AA model (AMH+Age) (8, 10). The original AA model employed categorical transformations of its predictors. Categorical‐variable models are simple and interpretable: by grouping data they capture overall trends, maintain stability, and exhibit strong robustness to outliers. However, grouping can be crude, failing to reflect subtle variations, and the choice of cutpoints is often subjective.
In contrast, continuous‐variable models retain all original information and—through appropriate transformations (e.g., cubic terms)—more accurately characterize nonlinear relationships, enhancing discrimination and predictive accuracy. However, they are more complex to build and interpret, sensitive to outliers, and prone to overfitting with limited sample sizes. For these reasons, and given our earlier sample size, we converted continuous variables into categorical ones. In this update, however, with a larger dataset, we experimented with multiple predictor transformations, and the final Model-2 achieved superior calibration compared to the original Model-0.
Comparison of parametric and non−parametric modeling approaches
During the modeling process, we explored various non-parametric methods (models without explicit functional forms), such as random forest, neural networks, and generalized additive models (GAMs). Although these models performed slightly better on the training set, their performance on the test set was similar to that of our traditional parametric models, with AUCs of 0.861, 0.863, and 0.863 for random forest, neural networks, and GAMs, respectively. This indicates that the non-parametric models achieved comparable results to the parametric approaches discussed in this manuscript. In comparison, we preferred parametric models with clear predictive formulas and high interpretability. These models have well-defined algorithmic principles that enable clinicians to intuitively understand the decision-making mechanisms, thereby improving trust and acceptance. Additionally, they are less demanding in terms of computational resources, making them easier to implement across different levels of healthcare institutions. Therefore, we ultimately selected a model that combines strong interpretability with practical clinical applicability.
Impact of estradiol-driven AMH variation on model performance
During controlled ovarian stimulation, estradiol (E2) levels rise steadily, peaking around the hCG trigger day. Our supplementary data show that AMH concentrations decline significantly during this period—about 17.4% by day 6 and nearly 49.7% by the hCG day compared to day 2 levels (As shown in supplementary result). This sharp AMH decrease at hCG day leads to a notable drop in model performance, with the predictive accuracy (AUC) falling from 0.868 to 0.652. However, when AMH variation is minimal or moderate earlier in ovarian stimulation (e.g., cycle day 6 VS cycle day 2), model discrimination remains largely stable. These results highlight the importance of measuring AMH at the early follicular phase, when levels are more stable and E2 is low, to ensure optimal accuracy of ovarian reserve assessments and reproductive milestone predictions with tools like OvaRePred (HerTempo).
Sources of AMH variability
When using the OvaRePred (HerTempo) tool, users should note that AMH is the primary and most heavily weighted predictive marker (8, 15), any element that causes AMH fluctuations may impact its prediction performance. Although AMH is generally considered stable during the menstrual cycle (16, 17), fluctuations in follicle status can lead to significant intra-cycle changes. AMH naturally declines with age (18), emphasizing the need for regular ovarian reserve assessments. Estrogenic drugs—such as those used in hormone replacement therapy or oral contraceptives—can temporarily lower AMH levels (19). Similarly, ovarian stimulation with FSH during assisted reproductive treatment (20), certain chemotherapy agents like cyclophosphamide (21), and acute ovarian conditions (e.g., cysts or inflammation) can affect AMH levels (22). Additional factors such as stress, systemic inflammation (23, 24), lifestyle changes like intense exercise or extreme diets (25), rapid weight fluctuations (26), testing time, blood collection techniques, sample storage conditions (27, 28), and variations between testing platforms or reagent batches (29) may all contribute to technical variability. Early pregnancy may cause a temporary decline in AMH (30), and short-term exposure to environmental endocrine disruptors (e.g., bisphenol A) can also affect AMH levels, either temporarily or permanently (31, 32).
Standardization and best practices for AMH measurement
We recognize that the reliability of AMH-based predictive models depends critically on standardized measurement protocols and meticulous clinical documentation. Although AMH is generally considered relatively stable, both biological and technical factors—including cycle variability, medications, ovarian pathology, environmental exposures, and assay differences—can influence its levels (16, 17, 19, 25, 26, 29, 31, 32).
To minimize variability and improve the accuracy of both clinical and research applications, we recommend that blood samples for AMH measurement be collected in the early follicular phase (typically cycle days 2–3), whenever feasible. At this time, estradiol levels are at their nadir, minimizing their suppressive effect on AMH secretion (33). Importantly, this is also the time window during which AMH measurements were obtained for the majority of participants in our model development and validation cohorts. For women with irregular menstrual cycles, identifying the early follicular phase may be more challenging. In such cases, clinicians should use careful cycle tracking and clinical judgment to approximate this window as closely as possible.
Repeat measurement of AMH may be considered in specific clinical scenarios, particularly when (i) the AMH value is unexpectedly low or high and inconsistent with other clinical findings (e.g., antral follicle count or reproductive history); (ii) the patient has recently undergone hormonal treatment (e.g., oral contraceptives, GnRH agonists); (iii) there is recent ovarian pathology (e.g., cysts, surgery, inflammation); or (iv) the measurement was taken during an ill-defined phase of the menstrual cycle or under unclear pre-analytical conditions. In such situations, a second AMH test—ideally performed in the early follicular phase and under stable physiological conditions—can improve result reliability and model prediction accuracy.
Longitudinal monitoring should be performed using the same validated assay platform. If switching platforms is unavoidable, calibration and conversion tools (such as our previously developed AMHConverter algorithm (34)) should be used to ensure comparability. Additionally, strict adherence to standardized protocols for sample collection, processing, transport, and storage is essential to minimize pre-analytical variability.
Finally, detailed clinical documentation—including the timing of the blood draw, menstrual cycle phase, concurrent medications, and any acute illness or environmental exposures—is crucial for interpreting AMH results accurately. By implementing these best practices, the robustness and clinical utility of AMH-based tools such as OvaRePred (HerTempo) can be further enhanced while maintaining consistency between model assumptions and real-world application.
Challenges for high−reserve women
Our OvaRePred (HerTempo) tool assumes that women have already passed the plateau phase—typically reached during puberty—when ovarian reserve peaks, before predicting future reproductive milestones. Consequently, many younger women with a high ovarian reserve may exhibit similar ovarian reserve scores for many years, which poses challenges in accurately predicting the timing of future reproductive milestones. For example, in women whose ovarian reserve remains high and stable, the model’s predicted age of perimenopause onset may substantially precede the actual age at which it will occur.
Platform flexibility and mitigation of batch variability
The updated version of the OvaRePred (HerTempo) tool now supports various AMH detection platforms, with conversion algorithms derived from our prior research (34). Moreover, we will continue to collaborate with reagent manufacturers to further mitigate the influence of batch variability on the results.
Generalizability to the general population
The AMH–age model was developed from single-center ART data, yet our AMH distribution analysis shows that women under 40 in the community cohort have AMH values comparable to those in the ART cohort. This observation supports the model’s general applicability for women ≤40 years and provides reassurance that ART-derived AMH–age relationships mirror those in the general population. However, differences observed in women >40 years indicate that future studies should recruit more older general participants to improve accuracy across the entire age spectrum.
Systemic effects of ovarian aging and holistic utility of OvaRePred
Ovarian aging is a pivotal component of the overall female aging process. Beyond its essential role in reproduction, the ovary functions as a crucial endocrine gland, regulating systemic homeostasis through the secretion of estrogen, progesterone, and androgens. As ovarian reserve diminishes and functional decline ensues, the consequent reduction in sex hormone production has been associated in prior literature with increased risk of cardiovascular diseases, osteoporosis, cognitive decline, metabolic disorders, immune dysregulation, and skin aging (35–43).
OvaRePred (HerTempo) was designed as an early−warning tool for ovarian aging. Specifically, for women with a low ovarian reserve score and an earlier predicted onset of perimenopause, OvaRePred may support earlier fertility planning or consideration of oocyte cryopreservation. Conversely, those with strong ovarian reserve and later predicted perimenopause may align reproductive timing with life goals. These fertility-focused applications remain the core validated purpose of the tool.
In addition to guiding fertility decisions, the OvaRePred tool can inform comprehensive health management, as illustrated in Figure 5. In this context, the significance of OvaRePred (HerTempo) extends far beyond predicting ovarian aging, offering a comprehensive tool for holistic women’s health management. However, the potential to expand OvaRePred’s utility for anticipatory health guidance remains an area for future research and should not be construed as a current clinical indication. While these associations suggest that ovarian health may play a broader role in women’s systemic well-being, we acknowledge that the current version of OvaRePred (HerTempo) has only been validated in the context of assisted reproduction. Therefore, these broader implications are only hypothetical future directions rather than validated applications.

Figure 5. Infographic illustrating OvaRePred’s concept of ovarian reserve scoring. A female silhouette highlights the ovary as the “core of female aging,” linking endocrine age to cardiovascular, bone, cognitive and metabolic screening priorities. The gauge depicts reserve score (0‑100); a timeline aligns diminished reserve, perimenopause and menopause for anticipatory, proactive health management.
Population−level applications and public health integration
OvaRePred (HerTempo)’s validated, user-friendly algorithm enables scalable ovarian reserve assessment at the population level. Integration into national or regional women’s health programs could facilitate early detection of diminished ovarian reserve, guide fertility counseling, and inform public health resource planning. Key barriers—such as test cost, laboratory access, and low public awareness—can be mitigated through tiered screening approaches, point-of-care diagnostics, health education initiatives, and strategic policy support. Harnessing aggregated OvaRePred data may also enable population health surveillance and support data-driven policy-making to improve women’s health outcomes on a broader scale.
Limitations
Despite significant progress achieved in this study, the OvaRePred (HerTempo) tool has several limitations that warrant further refinement. First, the current model is primarily developed based on single-center ART population data; thus, broader applicability requires additional samples and multi-center external validation. It is crucial to address variability across different ethnicities, regions, and cultural backgrounds to continuously optimize model parameters and improve both clinical utility and predictive accuracy. Second, the lack of documented blood collection timing (i.e., specific day of the menstrual cycle) for some reproductive-age women may affect result accuracy, underscoring the need for improved data collection protocols to ensure stability and reproducibility. Third, while temporal data splitting was used to simulate prospective validation, this approach may be subject to time-based confounding—such as changes in clinical protocols, shifts in patient population characteristics, or external factors like the COVID-19 pandemic—which could influence model performance. Future studies should consider combining temporal and random splitting strategies or incorporating external datasets to better mitigate such effects. Moreover, expanding the range of input parameters—such as ovarian aging variations under diverse disease conditions, and influences from medications, lifestyle factors, and environmental exposures—could enhance the tool’s adaptability and precision in complex clinical settings, supporting more personalized health management.
The fixed-interval hypothesis, which posits a consistent temporal relationship between reproductive aging events, underpins part of our predictive framework. Although it is supported by cross-sectional and historical population data, this assumption remains unvalidated at the individual level. Longitudinal confirmation would require tracking natural menstrual and fertility patterns over many years; however, in contemporary settings, such prospective designs are extremely difficult to implement. The widespread use of hormonal contraception, together with other medical or lifestyle interventions, obscures natural reproductive trajectories and greatly limits the feasibility of obtaining uninterrupted, long-term observations. Consequently, reliance on this unverified assumption constitutes a significant limitation, particularly for making individualized long-term predictions. Future research should prioritize the rare opportunity to conduct long-term, prospectively designed cohort studies in populations minimally influenced by hormonal contraception or other factors that alter natural reproductive aging.
Importantly, OvaRePred (HerTempo) focuses solely on quantifying ovarian reserve, reflected by follicle quantity, and does not directly assess follicle or oocyte quality, nor overall fertility potential. This distinction is critical in clinical scenarios such as PCOS, where patients may exhibit high follicle counts but impaired oocyte developmental competence. In such cases, a high reserve score does not necessarily equate to optimal reproductive capacity. Therefore, for a more comprehensive fertility evaluation, OvaRePred results should be interpreted alongside complementary diagnostic markers—such as oocyte maturity rates, detailed hormonal profiles, and, where available, indicators of oocyte or embryo competence. Integrating these multidimensional assessments enables a more nuanced understanding of reproductive potential and guides individualized clinical decision-making.
Conclusion
In summary, OvaRePred (HerTempo), a comprehensive evaluation tool powered by big data and artificial intelligence, not only provides precise decision support for fertility planning but also paves a new path for holistic women’s health management. Through continual optimization and interdisciplinary integration, OvaRePred (HerTempo) is poised to become a vital bridge between reproductive medicine and overall health management, offering more precise and personalized health guidance for women.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.
Ethics statement
The studies involving humans were approved by The institutional review board of Peking University Third Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants’ legal guardians/next of kin because the dataset used in this study contains deidentified data.
Author contributions
HX: Writing – review & editing, Methodology, Investigation, Writing – original draft. GF: Formal Analysis, Writing – review & editing, Methodology, Writing – original draft. RY: Data curation, Writing – original draft, Formal Analysis. YH: Writing – original draft, Formal Analysis. HC: Investigation, Writing – original draft. RL: Funding acquisition, Supervision, Conceptualization, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research and/or publication of this article. This study was supported by National Key Research and Development Program of China (grant number 2024YFC2706900 and 2023YFC2705500); and the Innovation & Transfer Fund of Peking University Third Hospital (Grant No. BYSYZHKC2023102, and BYSYZHZB2020102).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The handling editor DL declared a past co-authorship with the author RL.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fendo.2025.1658068/full#supplementary-material
References
1. Wallace WH and Kelsey TW. Human ovarian reserve from conception to the menopause. PloS One. (2010) 5:e8772. doi: 10.1371/journal.pone.0008772
2. Broekmans FJ, Soules MR, and Fauser BC. Ovarian aging: mechanisms and clinical consequences. Endocr Rev. (2009) 30:465–93. doi: 10.1210/er.2009-0006
3. Podfigurna A, Lukaszuk K, Czyzyk A, Kunicki M, Maciejewska-Jeske M, Jakiel G, et al. Testing ovarian reserve in pre-menopausal women: why, whom and how? Maturitas. (2018) 109:112–7. doi: 10.1016/j.maturitas.2017.11.014
4. Tal R and Seifer DB. Ovarian reserve testing: a user’s guide. Am J Obstetrics Gynecology. (2017) 217:129–40. doi: 10.1016/j.ajog.2017.02.027
5. te Velde ER and Pearson PL. The variability of female reproductive ageing. Hum Reprod Update. (2002) 8:141–54. doi: 10.1093/humupd/8.2.141
6. Ferraretti AP, La Marca A, Fauser BC, Tarlatzis B, Nargund G, and Gianaroli L. ESHRE consensus on the definition of ‘poor response’ to ovarian stimulation for in vitro fertilization: the Bologna criteria. Hum Reprod. (2011) 26:1616–24. doi: 10.1093/humrep/der092
7. Alviggi C, Andersen CY, Buehler K, Conforti A, De Placido G, Esteves SC, et al. A new more detailed stratification of low responders to ovarian stimulation: from a poor ovarian response to a low prognosis concept. Fertil Steril. (2016) 105:1452–3. doi: 10.1016/j.fertnstert.2016.02.005
8. Xu HY, Feng GS, Yang R, Li R, and Qiao J. OvaRePred: Online tool for predicting the age of fertility milestones. Innovation. (2023) 4:100490. doi: 10.1016/j.xinn.2023.100490
9. Xu H, Wang W, Han Y, Li R, and Qiao J. OvaRePred: Redefing ovarian aging and pioneering the journey towards women’s health management. Innovation Med. (2024) 2:100068. doi: 10.59717/j.xinn-med.2024.100068
10. Han Y, Xu H, Feng G, Wang H, Alpadi K, Chen L, et al. An online tool for predicting ovarian reserve based on AMH level and age: A retrospective cohort study. Front Endocrinol (Lausanne). (2022) 13:946123. doi: 10.3389/fendo.2022.946123
11. Schielzeth H. Simple means to improve the interpretability of regression coefficients. Methods Ecol Evol. (2010) 1:103–13. doi: 10.1111/j.2041-210X.2010.00012.x
12. Feng G, Xu H, Wan S, Wang H, Chen X, Magari R, et al. Twelve practical recommendations for developing and applying clinical predictive models. Innovation Med. (2024) 2:100105. doi: 10.59717/j.xinn-med.2024.100105
13. Hao YX, Yang R, Li J, Zhou ZH, Qian WP, Zhang J, et al. Age-specific random day serum antimullerian hormone reference values for women of reproductive age in the general population: a large Chinese nationwide population-based survey. Am J Obstetrics Gynecology. (2022) 227:883. doi: 10.1016/j.ajog.2022.07.029
14. Xu H, Feng G, Yang R, Li R, and Qiao J. Reaffirming the role of ovarian reserve in fertility assessment: Insights from OvaRePred. Innovation Med. (2025) 3:100135. doi: 10.59717/j.xinn-med.2025.100135
15. Xu H, Shi L, Feng G, Xiao Z, Chen L, Li R, et al. An ovarian reserve assessment model based on anti-mullerian hormone levels, follicle-stimulating hormone levels, and age: retrospective cohort study. J Med Internet Res. (2020) 22:e19096. doi: 10.2196/19096
16. La Marca A, Grisendi V, and Griesinger G. How much does AMH really vary in normal women? Int J Endocrinol. (2013) 2013:959487. doi: 10.1155/2013/959487
17. Khodavirdilou R, Pournaghi M, Rezaei YR, Hajizadeh K, Khodavirdilou L, Javid F, et al. Does Anti-Mullerian hormone vary during a menstrual cycle? A systematic review and meta-analysis. J Ovarian Res. (2022) 15:78. doi: 10.1186/s13048-022-01006-z
18. Xu H, Zhang M, Zhang H, Alpadi K, Wang L, Li R, et al. Clinical applications of serum anti-mullerian hormone measurements in both males and females: an update. Innovation. (2021) 2:100091. doi: 10.1016/j.xinn.2021.100091
19. Yin WW, Huang CC, Chen YR, Yu DQ, Jin M, and Feng C. The effect of medication on serum anti-mullerian hormone (AMH) levels in women of reproductive age: a meta-analysis. BMC Endocr Disord. (2022) 22:158. doi: 10.1186/s12902-022-01065-9
20. La Marca A, Malmusi S, Giulini S, Tamaro LF, Orvieto R, Levratti P, et al. Anti-Mullerian hormone plasma levels in spontaneous menstrual cycle and during treatment with FSH to induce ovulation. Hum Reprod. (2004) 19:2738–41. doi: 10.1093/humrep/deh508
21. Ejaz K, Abid D, Juneau P, Chu J, and Hasni S. Use of gonadotropin-releasing hormone agonists for ovarian preservation in patients receiving cyclophosphamide for systemic lupus erythematosus: A meta-analysis. Lupus. (2022) 31:1706–13. doi: 10.1177/09612033221128740
22. Shi J, An D, Ye J, Fu RJ, and Zhao AM. Effect of early inflammatory reaction on ovarian reserve after laparoscopic cystectomy for ovarian endometriomas. J Obstetrics Gynaecology. (2022) 42:3124–8. doi: 10.1080/01443615.2022.2106559
23. Sun SL, Chen H, Zheng XX, Ma CY, and Yue RQ. Analysis on the level of IL-6, IL-21, AMH in patients with auto-immunity premature ovarian failure and study of correlation. Exp Ther Med. (2018) 16:3395–8. doi: 10.3892/etm.2018.6592
24. Henes M, Froeschlin J, Taran FA, Brucker S, Rall KK, Xenitidis T, et al. Ovarian reserve alterations in premenopausal women with chronic inflammatory rheumatic diseases: impact of rheumatoid arthritis, Behcet’s disease and spondyloarthritis on anti-Mullerian hormone levels. Rheumatology. (2015) 54:1709–12. doi: 10.1093/rheumatology/kev124
25. Vosnakis C, Georgopoulos NA, Rousso D, Mavromatidis G, Katsikis I, Roupas ND, et al. Diet, physical exercise and Orlistat administration increase serum Anti-Mullerian Hormone (AMH) levels in women with polycystic ovary syndrome (PCOS). Gynecol Endocrinol. (2013) 29:242–5. doi: 10.3109/09513590.2012.736557
26. Kloos J, Coyne K, and Weinerman R. The relationship between anti-Mullerian hormone, body mass index and weight loss: A review of the literature. Clin Obes. (2022) 12:e12559. doi: 10.1111/cob.12559
27. Resch J, Wolfer K, Barth A, and Kalberer M. Effects of storage conditions on the molecular-level composition of organic aerosol particles. Atmos Chem Phys. (2023) 23:9161–71. doi: 10.5194/acp-23-9161-2023
28. Gupta S, Jamwal V, Subhash B, and Mohanlal K. Study of antimullerian hormone (amh) level variation in females in assisted reproductive technology (art) centre. Archives. (2020) 4:11–4. doi: 10.32553/IJMBS.V4I9.1382
29. Magnusson A, Olerod G, Thurin-Kjellberg A, and Bergh C. The correlation between AMH assays differs depending on actual AMH levels. Hum Reprod Open. (2017) 2017:hox026. doi: 10.1093/hropen/hox026
30. Pankhurst MW, Clark CA, Zarek J, Laskin CA, and McLennan IS. Changes in circulating proAMH and total AMH during healthy pregnancy and post-partum: A longitudinal study. PloS One. (2016) 11:e0162509. doi: 10.1371/journal.pone.0162509
31. Priya K, Setty M, Babu UV, and Pai KSR. Implications of environmental toxicants on ovarian follicles: how it can adversely affect the female fertility? Environ Sci pollut R. (2021) 28:67925–39. doi: 10.1007/s11356-021-16489-4
32. Zhang NX, Zhao YN, Zhai LL, Bai YL, Jia LH, and Faggio C. Urinary bisphenol A and S are associated with diminished ovarian reserve in women from an infertility clinic in Northern China. Ecotox Environ Safe. (2023) 256:114867. doi: 10.1016/j.ecoenv.2023.114867
33. Grynberg M, Pierre A, Rey R, Leclerc A, Arouche N, Hesters L, et al. Differential Regulation of Ovarian Anti-Mullerian Hormone (AMH) by Estradiol through alpha- and beta-Estrogen Receptors. J Clin Endocr Metab. (2012) 97:E1649–E57. doi: 10.1210/jc.2011-3133
34. Xu H, Feng G, Ma C, Han Y, Zhou J, Song J, et al. AMHconverter: an online tool for converting results between the different anti-Mullerian hormone assays of Roche Elecsys((R)), Beckman Access, and Kangrun. PeerJ. (2023) 11:e15301. doi: 10.7717/peerj.15301
35. Traub ML and Santoro N. Reproductive aging and its consequences for general health. Reprod Aging. (2010) 1204:179–87. doi: 10.1111/j.1749-6632.2010.05521.x
36. den Ruijter HM and Kararigas G. Estrogen and cardiovascular health. Front Cardiovasc Med. (2022) 9:886592. doi: 10.3389/fcvm.2022.886592
37. Vaananen HK and Harkonen PL. Estrogen and bone metabolism. Maturitas. (1996) 23:S65–S9. doi: 10.1016/S0378-5122(96)90018-3
38. Yamamoto N, Wada-Hiraike O, Hirano M, Hirata T, Harada M, Hirota Y, et al. Ovarian reserve may influence the outcome of bone mineral density in patients with long-term use of dienogest. SAGE Open Med. (2021) 9. doi: 10.1177/20503121211005992
39. Meczekalski B, Niwczyk O, Bala G, and Szeliga A. Managing early onset osteoporosis: the impact of premature ovarian insufficiency on bone health. J Clin Med. (2023) 12:4042. doi: 10.3390/jcm12124042
40. Bustamante-Barrientos FA, Méndez-Ruette M, Ortloff A, Luz-Crawford P, Rivera FJ, Figueroa CD, et al. The impact of estrogen and estrogen-like molecules in neurogenesis and neurodegeneration: beneficial or harmful? Front Cell Neurosci. (2021) 15:636176. doi: 10.3389/fncel.2021.636176
41. Russell JK, Jones CK, and Newhouse PA. The role of estrogen in brain and cognitive aging. Neurotherapeutics. (2019) 16:649–65. doi: 10.1007/s13311-019-00766-9
42. Lizcano F and Guzmán G. Estrogen deficiency and the origin of obesity during menopause. BioMed Res Int. (2014) 2014:757461. doi: 10.1155/2014/757461
Keywords: ovarian reserve, anti-müllerian hormone (AMH), ovarian aging curve, endocrine age, reproductive milestone prediction, personalized fertility planning, health management
Citation: Xu H, Feng G, Yang R, Han Y, Chi H and Li R (2025) OvaRePred (HerTempo): an enhanced ovarian aging clock for personalized reserve assessment, endocrine age modeling, and predicting reproductive milestones across the female lifecycle. Front. Endocrinol. 16:1658068. doi: 10.3389/fendo.2025.1658068
Received: 02 July 2025; Accepted: 21 August 2025;
Published: 12 September 2025.
Edited by:
Da Li, China Medical University, ChinaReviewed by:
Zhijing Na, China Medical University, ChinaArash Ziaee, Mashhad University of Medical Sciences, Iran
Copyright © 2025 Xu, Feng, Yang, Han, Chi and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Rong Li, cm9zZWxpMDAxQGJqbXUuZWR1LmNu
†These authors have contributed equally to this work