- 1Department of Nuclear Medicine, The Affiliated Hospital of Qingdao University, Qingdao, Shandong, China
- 2Department of Radiology, Qingdao Women and Children’s Hospital, Qingdao, Shandong, China
- 3Department of Medical Oncology, The Affiliated Hospital of Qingdao University, Qingdao, Shandong, China
Background: Radioiodine (RAI) therapy, while established for Graves’ hyperthyroidism (GH), exhibits variable efficacy (50-80% cure rates), with non-complete remission (NCR) necessitating retreatment. In the study, we aimed to identify independent predictors of NCR and develop a validated nomogram for personalized RAI outcome prediction.
Methods: Data from 285 GH patients undergoing initial RAI therapy were retrospectively analyzed and randomly allocated into training (n=199) and validation (n=86) cohorts at a 7:3 ratio. Univariate followed by multivariate logistic regression identified independent predictors of NCR in the training cohort. These variables informed the construction of a prognostic nomogram model, subsequently verified in the validation cohort through calibration, receiver operating characteristic (ROC) curve, and decision curve analysis (DCA) to assess model reliability, discriminative ability, and clinical utility.
Results: Thyroid mass (TM), 24-hour RAI uptake (RAIU24h), effective half-life (Teff), and free triiodothyronine reduction at 1-month post-therapy (ΔFT3) were independent predictors. The prognostic nomogram integrating these variables exhibited superior discriminative performance in both training (AUC = 0.919) and validation cohorts (AUC = 0.901). Calibration curves confirmed high fidelity between predicted and observed NCR probabilities. DCA demonstrated significant clinical net benefit across threshold probabilities.
Conclusion: TM, RAIU24h, Teff, and ΔFT3 are critical determinants of RAI efficacy in GH. The validated nomogram enables precise NCR risk stratification, facilitating optimized activity prescription to improve remission rates.
1 Introduction
Graves’ hyperthyroidism (GH), an organ-specific autoimmune disorder, represents the most prevalent cause of thyrotoxicosis, characterized by excessive thyroid hormone production due to thyroid-stimulating hormone (TSH) receptor antibodies (TRAb) (1–3). The clinical triad of hypermetabolism, diffuse goiter, and orbitopathy stems from systemic effects of thyroid hormone excess, which also disrupts bone metabolism, cardiovascular function, and other physiological processes. Left untreated, GH carries significant morbidity risks including atrial fibrillation, osteoporosis, and thyrotoxic crisis (4–6).
The current first-line therapeutic approaches for GH include antithyroid drugs (ATDs), radioiodine (RAI) therapy, and thyroidectomy. It should be noted that these treatments primarily aim to reduce circulating thyroid hormone levels rather than directly addressing the underlying etiology, such as modulating TRAb or correcting thyroid autoimmune dysregulation (4, 5).
RAI therapy has been established as a treatment modality for hyperthyroidism for over eight decades. Recent epidemiological studies indicate that 59% of North American clinicians prefer RAI as the initial treatment for GH (7). Compared with alternative therapies, RAI offers distinct advantages including superior efficacy, fewer adverse effects, and more rapid onset of therapeutic action. Current clinical guidelines strongly recommend RAI for patients who exhibit poor response to ATDs, experience disease recurrence after ATDs withdrawal, develop ATDs-related adverse reactions (e.g., leukopenia or hepatic dysfunction), or present with complications such as atrial fibrillation (4, 5). Despite the well-established efficacy of RAI, treatment outcomes exhibit considerable variability, with single-dose cure rates ranging from 50-80% across studies, and hypothyroidism developing in 20-30% of patients within the first year (8–10).
Existing predictive models for RAI therapy outcomes in GH have largely relied on static pre-therapeutic parameters, such as thyroid mass (TM), 24-hour radioiodine uptake (RAIU24h), and effective half-life (Teff) (9, 11, 12). A significant limitation of these models is their inability to account for early biological response, which reflects the initial radiation-induced thyroid follicular cell damage. The dynamics of thyroid hormone levels, particularly the reduction in free triiodothyronine (FT3)/free thyroxine (FT4) at one month post-RAI, offer a pathophysiologically grounded biomarker that may serve as an early indicator of treatment efficacy. However, the prognostic value of this early hormonal response for predicting non-remission remains inadequately characterized. Therefore, this study was designed to elucidate the predictive power of early thyroid function dynamics and to identify determinants of treatment non-remission, with the ultimate goal of informing personalized therapeutic strategies.
2 Materials and methods
2.1 Study conduct and patients
This retrospective study was conducted at the Affiliated Hospital of Qingdao University. The protocol was approved by the institution’s Ethics Review Board with a waiver of informed consent (Ethics approval number: QYFY WZLL 30462). From January 2021 to December 2023, all GH patients who first underwent RAI therapy in the Affiliated Hospital of Qingdao University were reviewed.
GH was diagnosed according to established criteria comprising: (i) biochemical confirmation of thyrotoxicosis (suppressed TSH with elevated free thyroid hormones), (ii) diffuse thyroid enlargement on palpation, and (iii) either TRAb positivity or increased radioactive iodine uptake (RAIU). Characteristic extrathyroidal manifestations (orbitopathy or pretibial myxedema) provided supporting evidence when present.
The inclusion criteria were as follows: (i) confirmed GH diagnosis; (ii) discontinuation of ATDs for at least one week preceding RAI, and (iii) no ATDs administration following RAI. The exclusion criteria comprised: (i) previous or fractionated RAI, (ii) lost to follow-up or incomplete clinical data, and (iii) concurrent suspicious malignant thyroid nodules.
2.2 Data collection and RAI procedures
Prior to RAI administration, all patients were instructed to maintain a low-iodine diet and discontinue iodine-containing medications for 7–14 days. ATDs were discontinued as recommended by the 2016 ATA hyperthyroidism guidelines (methimazole for 3–5 days or propylthiouracil for 1–2 weeks) prior to RAI administration to enhance thyroidal RAI uptake (5). Pregnancy and lactation women constituted absolute contraindications for treatment (5, 13).
Laboratory assessments were performed to evaluate thyroid function, including TSH (reference range: 0.75-5.6 μIU/mL), free triiodothyronine (FT3; reference range: 3.1-6.8 pmol/L), free thyroxine (FT4; reference range: 12.8-21.3 pmol/L), thyroglobulin antibody (TgAb; reference range: 0–115 IU/mL), thyroid peroxidase antibody (TPOAb; reference range: 0–34 IU/mL), and TRAb (reference range: 0-1.75 IU/mL). Serum levels of FT3, FT4, TSH, TgAb, TPOAb, and TRAb were quantified using a Cobas e 801 automated electrochemiluminescence immunoassay analyzer (Roche Diagnostics GmbH, Mannheim, Germany) with the corresponding reagent kits (Elecsys FT3 III, Elecsys FT4 III, Elecsys TSH, Elecsys Anti-Tg, Elecsys Anti-TPO, and Elecsys Anti-TSHR), in strict accordance with the manufacturer’s protocols. All patients routinely underwent thyroid ultrasonography to assess for the presence of nodules, as well as 99mTc-pertechnetate thyroid scan to exclude subacute thyroiditis and measure thyroid volume. Thyroid volume (TV) was calculated using the following formula: TV (cm³)= mean thyroid lobe height (cm)×sum of bilateral frontal projected areas (cm²)×K, where K is a correction factor ranging between 0.23 and 0.32, depending on imaging conditions (14). Subsequently, TM was estimated from the volume by assuming a tissue density of 1.0 g/cm³ [TM (g)=1×TV (cm³)] (13, 15). The thyroid RAIU was dynamically assessed at 2, 4, and 24 hours post-administration to determine the RAIU24h and Teff.
The therapeutic 131I activity was calculated based on TM and RAIU24h using the following formula: Oral 131I activity (μCi) = [prescribed dose (μCi/g) × TM (g)]/RAIU24h (%). Three experienced nuclear medicine physicians determined the prescribed iodine dose per gram of thyroid tissue for each patient according to their clinical condition, typically ranging between 70-150 μCi/g.
We conducted a comprehensive review of clinical parameters obtained from electronic medical records, including: patient demographics (age, gender, disease course, TM, body mass index [BMI]); thyroid function tests (FT3, FT4, TSH); immunological markers (TRAb, TgAb, TPOAb); nuclear medicine parameters (RAIU24h, Teff); and treatment-related data (total RAI dose). Teff was calculated from the serial RAI uptake measurements. It is defined as the time required for the RAI activity within the thyroid gland to decay to 50% of its initial value, resulting from the combined effects of physical decay and biological clearance (5, 9).
For analytical purposes, continuous variables of thyroid autoantibodies were categorized based on the assay’s reference ranges and established clinical cut-off points. Specifically, TRAb levels were classified as follows: <1.75 IU/mL (normal), 1.75–40 IU/mL (mildly elevated), and >40 IU/mL (significantly elevated). TgAb levels were categorized as <115 IU/mL (normal), 115–4000 IU/mL (moderately elevated), and >4000 IU/mL (highly elevated). TPOAb levels were categorized as <34 IU/mL (normal), 34–600 IU/mL (moderately elevated), and >600 IU/mL (highly elevated).
2.3 Magnitude of thyroid function changes at one month post-RAI
FT3 and FT4 levels were measured at two time points: 1–2 days before RAI and 1 month after RAI, designated as pre-RAI FT3, post-RAI FT3, pre-RAI FT4, and post-RAI FT4, respectively. The percent changes in thyroid hormone levels (ΔFT3 and ΔFT4) at 1 month post-RAI were calculated using the following equations: ΔFT3 = [(pre-RAI FT3 - post-RAI FT3)/pre-RAI FT3] × 100%; ΔFT4 = [(pre-RAI FT4 - post-RAI FT4)/pre-RAI FT4] × 100%.
2.4 Evaluation for the therapeutic efficacy of RAI
Following RAI therapy, all enrolled patients underwent a comprehensive clinical and biochemical evaluation during follow-up visits conducted at 6 to 12 months after treatment. Therapeutic outcomes were rigorously classified based on predefined response criteria (4, 5):
1. Euthyroidism: Resolution of GH symptoms with normalization of FT3, FT4, and TSH levels;
2. Hypothyroidism: Presence of hypothyroid symptoms (which could be absent in subclinical cases) accompanied by subnormal FT3 or FT4 and elevated TSH concentrations.
3. Partial Remission: Improvement in GH-related symptoms and signs with partial reduction, but not normalization of thyroid hormone levels.
4. Non-remission: Persistent or worsening hyperthyroid manifestations without significant decrease in FT3/FT4 concentrations.
For analytical purposes, patients achieving either euthyroidism or hypothyroidism were classified as the “complete remission” (CR) group, while those with persistent hyperthyroidism (including cases of partial remission and treatment failure) were categorized as the “non-complete remission” (NCR) group (9, 11, 12).
2.5 Study design and statistical analysis
GH patients were randomly allocated into training and cohorts at a 7:3 ratio. Statistical analysis was performed to evaluate the differences in ΔFT3 and ΔFT4 values between treatment response groups within the training cohort using the Mann-Whitney U test. Additionally, receiver operating characteristic (ROC) curve analysis was conducted to assess the predictive performance of these parameters for NCR. Univariate logistic regression analysis was initially performed in the training cohort to identify potential predictors of RAI response, followed by multivariate analysis to determine independent risk factors for NCR. These significant predictors were subsequently incorporated into a prognostic nomogram constructed to estimate the probability of NCR following RAI therapy. The predictive accuracy and discriminative ability of the nomogram were then rigorously evaluated in the validation cohort.
The nomogram’s discriminative ability was evaluated using Harrell’s concordance index (C-index), which measures the agreement between predicted probabilities and actual outcomes. ROC curve analysis further assessed predictive performance, with the area under the curve (AUC) serving as a quantitative measure of accuracy. Calibration curves graphically compared predicted probabilities against observed event rates, evaluating model precision. Additionally, decision curve analysis (DCA) determined clinical utility by quantifying net benefit across various probability thresholds. All statistical analyses—including nomogram construction (rms[6.4.0], ResourceSelection[0.3-5]), C-index calculation (rms[6.4.0], ResourceSelection[0.3-5]), ROC analysis (pROC[1.18.0], ggplot2[3.4.4]), calibration curve assessment (rms[6.4.0], ResourceSelection[0.3-5]), and DCA evaluation (rmda[1.6], ggplot2[3.4.4])—were performed using R software (version 4.2.1). A two-tailed P-value < 0.05 defined statistical significance.
3 Results
3.1 Baseline clinical characteristics
Between January 2021 and December 2023, our institution enrolled 328 GH patients for RAI therapy. Following the study protocol, 43 subjects were excluded due to loss to follow-up or incomplete clinical data, yielding an 86.9% (285/328) eligibility rate for final analysis. The cohort was subsequently randomized into training (n=199) and validation (n=86) subsets at a 7:3 ratio for predictive model development and therapeutic outcome evaluation, as detailed in Figure 1.
Figure 1. Research flowchart. GH, Graves’ hyperthyroidism; RAI, radioiodine; CR, complete remission; NCR, non-complete remission; Teff, thyroid effective half-life; TM, thyroid mass; RAIU24h, 24-hour radioactive iodine uptake; ROC, receiver operating characteristic; and DCA, decision curve analysis.
Baseline clinical characteristics of the GH patients were well-balanced between the training and validation cohorts, as summarized in Table 1 (all p>0.05).
3.2 Clinical outcomes of GH patients after initial RAI therapy in the training and validation cohorts
Figure 2 demonstrated the clinical outcomes following initial RAI administration in the training and validation cohorts. NCR rates were similar between two cohorts (41.7% [83/199] vs 40.7% [35/86], p=0.874).
Figure 2. Clinical outcomes following RAI therapy in the training and validation cohorts. (A) Four-category therapeutic response (euthyroidism, hypothyroidism, partial remission, and non-remission). (B) Binary therapeutic response classification (CR vs. NCR). RAI, radioiodine; CR, complete remission; NCR, non-complete remission.
3.3 Comparison of the ΔFT3 and ΔFT4 in different therapeutic outcomes in the training cohort
The median ΔFT3 in CR group was 71.8%, while that in the NCR group was only 41.1%. There was a statistical difference between the two groups (U = 1429.0, P < 0.001; Figure 3A). Meanwhile, the median ΔFT4 in CR group was 56.3%, which was also significantly higher than the 32.3% in the NCR group (U = 1667.0, P < 0.001; Figure 3C).
Figure 3. Comparison of the ΔFT3 (A) and ΔFT4 (C) level for CR and NCR in the training cohort. ROC curves of ΔFT3 (B) and ΔFT4 (D) for detecting NCR in the training cohort. CR, complete remission; NCR, non-complete remission; ROC, receiver operating characteristic; AUC, area under the ROC curve; *** P<0.001.
The cut-off value of ΔFT3 at 62.9% and ΔFT4 at 50.5% were obtained by ROC curve analyses to best distinguish CR and NCR, with corresponding specificities of ΔFT3 and ΔFT4 separately of 72.4% and 67.2%, and sensitivities of 84.3% and 88.0%, and AUCs of 0.852 and 0.827, respectively (Figures 3B, D). Thus, if the ΔFT3 was lower than 62.9% and/or ΔFT4 was lower than 50.5%, the curative effect was more likely to be NCR.
3.4 Risk factors for NCR of GH patients after initial RAI therapy in the training cohort
Upon analyzing the relation between clinical characteristics and NCR in GH patients after initial RAI therapy, 13 factors were included in the univariate analyses (Table 2). The results indicated that patients with disease course>2 years (P = 0.007), higher TRAb level (P = 0.002), lower TPOAb (P = 0.039), shorter Teff (P<0.001), lower RAIU24h (P<0.001), higher total dose (P<0.001), higher TM (P<0.001), ΔFT3 <62.9% (P<0.001) or ΔFT4 <50.5% (P<0.001) had a high likelihood of developing NCR. By contrast, no significant differences were found in gender (P = 0.636), age (P = 0.587), BMI (P = 0.051), and TgAb levels (P = 0.135).
Multivariate analysis revealed that Teff (odds ratio [OR]: 0.438, 95% confidence interval [CI]: 0.288-0.667, P < 0.001), TM (OR: 1.031, 95% CI: 1.016-1.046, P < 0.001), RAIU24h (OR: 0.934, 95% CI: 0.899-0.969, P < 0.001), and ΔFT3(<62.9%/≥62.9%) (OR: 0.092, 95% CI: 0.037-0.225, P < 0.001) were significantly associated with the risk of NCR (Table 3).
Table 3. Multivariate analysis for NCR in GH patients using logistic regression in the training cohort.
3.5 Development of a prognostic nomogram for NCR rates in GH patients after initial RAI in the training cohort
Based on the independent risk factors identified by multivariate logistic regression, we constructed a Nomogram to predict the probability of NCR in GH patients after initial RAI therapy in the training cohort. The predictive model incorporates weighted scores for each variable, with TM demonstrating the strongest association with NCR risk, followed by RAIU24h, Teff, and ΔFT3 (Figure 4). By summing the individual scores and projecting the total points onto the risk axis, clinicians can estimate the probability of NCR.
Figure 4. Clinical nomogram for predicting NCR risk following RAI therapy in the training cohort. Implementation protocol: To use it, one first locates a patient’s specific values for Teff, TM, and RAIU24h on their respective axes and draws a vertical line upward to the ‘Points’ axis to assign a partial score for each variable; for the categorical variable ΔFT3, one selects the corresponding point value based on whether the value is above or below the 62.9% threshold. The points for all four variables are then summed, and this total is located on the ‘Total Points’ axis; finally, by projecting a line downward from the total points to the bottom ‘Predicted Probability of NCR’ axis, the clinician can read the patient’s individualized risk of NCR, thereby facilitating personalized therapeutic decisions. NCR, non-complete remission; RAI, radioiodine; Teff, thyroid effective half-life; TM, thyroid mass; RAIU24h, 24-hour radioactive iodine uptake.
3.6 Evaluation of the nomogram model
The nomogram demonstrated strong predictive performance for NCR in GH patients following initial RAI therapy, with a C-index of 0.919 (95% CI 0.881-0.957). Calibration curves showed excellent agreement between predicted and observed outcomes (Figure 5A). When comparing predictive accuracy, the nomogram (AUC = 0.919) outperformed individual parameters including ΔFT3 (AUC = 0.784), TM (AUC = 0.768), Teff (AUC = 0.753), and RAIU24h (AUC = 0.658) (Figure 6A). Additionally, Decision curve analysis confirmed that the nomogram model had good clinical utility in predicting NCR (Figure 7A), with validation cohort results mirroring these findings (Figures 5B, 6B, 7B).
Figure 5. Performance evaluation of the nomogram for predicting non-complete remission (NCR) rates in training and validation cohorts. Calibration plots reveal strong agreement between predicted and observed NCR rates, with curves closely following the ideal reference line (45°diagonal) for both training (A) and validation (B) cohorts.
Figure 6. Performance evaluation of the nomogram for predicting NCR rates in training and validation cohorts. ROC curves demonstrate discrimination accuracy for NCR probability in both training (A) and validation (B) cohorts (AUC 0.919 and 0.894, respectively). NCR, non-complete remission; AUC, area under the ROC curve.
Figure 7. Performance evaluation of the nomogram for predicting NCR rates in training and validation cohorts. DCA confirms clinical utility, showing significant net benefit across for therapeutic decision-making in both training (A) and validation (B) cohorts. NCR, non-complete remission; DCA, decision curve analysis.
4 Discussion
RAI therapy remains a fundamental treatment modality for GH patients, with well-documented efficacy spanning over eight decades of clinical use (5, 7). Contemporary practice patterns reveal a preference among clinicians for administering higher RAI doses (typically 150-200 μCi/g thyroid tissue) to achieve rapid induction of hypothyroidism and reduce the need for retreatment. However, clinical outcomes demonstrate significant variability, with treatment failure rates persisting at 8-15% despite optimized dosimetric approaches, often requiring secondary interventions within 3–6 months of initial RAI therapy (4, 5, 9, 16). While previous investigations have primarily focused on absorbed radiation dose calculations, our study addresses a critical knowledge gap by systematically evaluating four understudied prognostic variables: RAIU24h, Teff, TM, and ΔFT3. Through multivariate analysis of these parameters, we developed and validated a clinically practical nomogram that predicts the probability of NCR following initial RAI therapy, thereby facilitating more personalized therapeutic decision-making.
Our study robustly positions Teff as a significant and independent predictor of treatment failure following RAI therapy for GH disease. The Teff, defined as the time for intra-thyroidal RAI activity to decay to 50% of its initial value, integrates the combined effects of physical decay and biological clearance. It quantifies the composite kinetics of iodine trapping, organification, and retention within thyrocytes (5, 9). Patients exhibiting slower iodine clearance (long Teff) inherently sustain a higher cumulative radiation exposure for any given administered activity per gram compared to those with rapid turnover (short Teff) (17). Consequently, Teff directly determines the cumulative beta-emission absorbed dose; this absorbed dose governs the extent of thyroid follicular cell destruction. While studies by Yu et al. (8) and Wei et al. (18) indicated that a prolonged Teff correlates with a higher incidence of post-therapeutic hypothyroidism, conflicting evidence exists in the literature. Other investigators (19, 20) observed no statistically significant difference in Teff between patients achieving CR and those with NCR following RAI therapy.
Our findings, consistent with prior investigations (9, 21, 22), demonstrate a significant association between elevated RAIU24h and favorable therapeutic outcomes following initial RAI therapy for GH disease. This underscores its fundamental role in thyroidal dosimetry, as RAIU24h directly influences the absorbed radiation dose delivered to the thyroid tissue. However, this positive association contrasts with studies reporting an inverse correlation between RAIU24h and cure rates in GH disease, suggesting reduced efficacy at higher RAIU24h uptake levels (16, 23, 24). This discrepancy may arise from confounding factors inadequately controlled in earlier analyses. Proposed mechanisms include potential inherent radio resistance in thyroid tissue exhibiting higher RAIU24h uptake levels (25), and the critical impact of dosimetry methodology. Within various RAI dosimetric formulae, prescribed RAI activity is typically inversely proportional to RAIU24h values; consequently, subtherapeutic radiation doses derived from such calculations may precipitate treatment failure despite elevated RAIU24h (26, 27), potentially masking the true biological significance of RAIU24h. Notwithstanding these complexities and conflicting reports, our robust statistical analysis controlling for key variables strongly supports the conclusion that RAIU24h constitutes an independent risk factor for RAI therapy efficacy.
Markovic et al. (28) demonstrated significantly superior therapeutic outcomes in GH patients with thyroid glands <62 g (treatment failure rate: 9.6%) compared to those with glands >62 g (persistent hyperthyroidism rate: 44%). This volumetric dependence is further reinforced by Szumowski et al. (29), who demonstrated a significant inverse correlation between thyroid gland volume and therapeutic efficacy (P<0.002), with larger volumes predicting diminished ablative success. Consistent with existing literatures (12, 20, 28, 29), our study confirmed TM as an independent prognostic factor influencing RAI therapy outcomes in GH patients, with increasing TM demonstrating a significant inverse correlation with RAI efficacy. Collectively, these observations substantiate the imperative for rigorous pre-therapeutic assessment in GH patients undergoing RAI therapy, employing quantitative 99mTc-pertechnetate thyroid scan or ultrasonography. This evaluation is critical for accurate glandular volumetry and functional characterization, thereby enabling optimized stratification of the administered RAI activity to achieve therapeutic objectives.
RAI therapy ablates thyroid tissue via β-ray-induced follicular cell necrosis, reducing thyroid hormone synthesis (5). In the present study, the significantly greater reduction in ΔFT3 and ΔFT4 in CR versus NCR groups at 1-month post-therapy reflect the extent of follicular destruction, as FT3 and FT4 kinetics are highly sensitive to thyroidal secretory activity. This phenomenon aligns with GH pathophysiology. Kagayama et al. (30). demonstrated that serum triiodothyronine (T3) concentrations in GH exceed thyroxine (T4) levels by >4-fold, attributable to intrathyroidal preferential T3 secretion—a pathognomonic feature of GH hyperfunction. Consequently, FT3 being independent of thyroxine-binding globulin variations, exhibits superior diagnostic sensitivity for GH. Kaplan et al. (31) further established that accelerated peripheral T4-to-T3 deiodination contributes to elevated FT3 concentrations, while FT3’s 3–4-fold higher biological potency relative to FT4 underscores its critical role in evaluating GH severity. Our multivariate analysis revealed ΔFT3 as an independent predictor of treatment response after adjusting for gender, age, and TRAb levels, confirming its utility in early therapeutic assessment.
The selection of ΔFT3 over ΔFT4 in the final predictive model was guided by the pathophysiology of GH and statistical findings. The characteristic T3-dominant secretory profile of Graves’ hyperthyroidism renders ΔFT3 a more direct biomarker of the early therapeutic effect of RAI on thyroid hormone secretion. Furthermore, in multivariate analysis, ΔFT4 did not retain independent predictive value, indicating that its information was encompassed by other variables, including ΔFT3.
The primary clinical utility of ΔFT3, as identified in our study, lies in early outcome prediction and patient management after the initial RAI therapy. By measuring the magnitude of the FT3 decrease at one month, clinicians can use our nomogram to identify patients with a high probability of requiring additional therapy (e.g., a second RAI treatment or anti-thyroid drugs) at a very early stage. This enables timely clinical decision-making, avoiding prolonged waiting periods to confirm treatment failure based solely on thyroid function tests at 3 or 6 months.
Multivariate logistic regression identified four independent predictors of NCR following initial RAI therapy for GH disease: RAIU24h, Teff, TM, and ΔFT3 (Table 3). To enhance prognostic precision beyond individual parameters, we developed a composite nomogram integrating these variables (Figure 4). Validation through ROC analysis demonstrated significantly superior discrimination for the nomogram versus any single predictor in both training and validation cohorts (Figures 6A, B). The model maintained excellent calibration fidelity and provided positive net benefit across clinical decision thresholds on DCA, supporting its utility for personalized RAI activity prescription. Implementation may optimize therapeutic dosing to reduce retreatment requirements while maintaining remission rates.
However, this study has several limitations requiring acknowledgment. First, the retrospective cohort design inherently carries risks of selection bias. Second, the prognostic nomogram underwent only internal validation without external cohort verification. Third, therapeutic outcomes were assessed solely at short-term follow-up (6–12 months), lacking longitudinal data on relapse rates beyond 24 months. Prospective multicenter studies with larger cohorts are warranted to externally validate this model and evaluate its impact on long-term remission sustainability.
5 Conclusion
This study establishes a clinically robust prognostic model for predicting NCR following initial RAI therapy in GH patients. Multivariate analysis identified four independent predictors of treatment failure: TM, RAIU24h, Teff and ΔFT3. The integration of these parameters into a prognostic nomogram demonstrated superior discriminative performance compared to individual variables, with validation cohort results confirming its reliability. Collectively, this nomogram enables personalized RAI activity prescription by stratifying patients at high risk for NCR. Its implementation may optimize therapeutic dosing strategies, reduce retreatment requirements, and improve remission rates.
Data availability statement
The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding authors.
Ethics statement
The studies involving humans were approved by Ethical Committee of the Affiliated Hospital of Qingdao University (Ethics approval number: QYFY WZLL 30462). The studies were conducted in accordance with the local legislation and institutional requirements. The ethics committee/institutional review board waived the requirement of written informed consent for participation from the participants or the participants' legal guardians/next of kin because this study is a retrospective study.
Author contributions
CW: Investigation, Writing – review & editing, Data curation, Formal analysis, Writing – original draft, Conceptualization, Funding acquisition. YTL: Investigation, Formal analysis, Methodology, Writing – original draft, Conceptualization. GQ: Conceptualization, Methodology, Formal analysis, Writing – original draft, Data curation. YS: Writing – original draft, Methodology, Investigation, Conceptualization. XY: Data curation, Writing – review & editing, Methodology, Conceptualization. YQL: Methodology, Formal analysis, Investigation, Writing – review & editing, Data curation, Conceptualization. XW: Methodology, Formal analysis, Software, Writing – review & editing, Conceptualization, Project administration.
Funding
The author(s) declare financial support was received for the research and/or publication of this article. The present study was supported by grants from the Shandong Provincial Natural Science Foundation (grant no. ZR2022QH127).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Nilsson JN, Elovsson R, Thor D, Calissendorff J, and Ardenfors O. Radioiodine treatment outcome by dosimetric parameters and renal function in hyperthyroidism. Thyroid Res. (2022) 15:8. doi: 10.1186/s13044-022-00126-4
2. Taylor PN, Albrecht D, Scholz A, Gutierrez-Buey G, Lazarus JH, Dayan CM, et al. Global epidemiology of hyperthyroidism and hypothyroidism. Nat Rev Endocrinol. (2018) 14:301–16. doi: 10.1038/nrendo.2018.18
3. Wiersinga WM, Poppe KG, and Effraimidis G. Hyperthyroidism: aetiology, pathogenesis, diagnosis, management, complications, and prognosis. Lancet Diabetes Endocrinol. (2023) 11:282–98. doi: 10.1016/S2213-8587(23)00005-0
4. Kahaly GJ, Bartalena L, Hegedüs L, Leenhardt L, Poppe K, and Pearce SH. 2018 european thyroid association guideline for the management of graves’ Hyperthyroidism. Eur Thyroid J. (2018) 7:167–86. doi: 10.1159/000490384
5. Ross DS, Burch HB, Cooper DS, Greenlee MC, Laurberg P, Maia AL, et al. 2016 american thyroid association guidelines for diagnosis and management of hyperthyroidism and other causes of thyrotoxicosis. Thyroid. (2016) 26:1343–421. doi: 10.1089/thy.2016.0229
6. Bahn RS, Burch HB, Cooper DS, Garber JR, Greenlee MC, Klein I, et al. Hyperthyroidism and other causes of thyrotoxicosis: management guidelines of the American Thyroid Association and American Association of Clinical Endocrinologists. Endocr Pract. (2011) 17:456–520. doi: 10.4158/ep.17.3.456
7. Burch HB, Burman KD, and Cooper DS. A 2011 survey of clinical practice patterns in the management of Graves’ disease. J Clin Endocrinol Metab. (2012) 97:4549–58. doi: 10.1210/jc.2012-2802
8. Yu F, Zhang R, Zhang G, Meng Z, Liu X, He Y, et al. Predictive value of a thyroid-absorbed dose with a shorter effective half-life on efficacy in graves disease patients receiving iodine-131 therapy. Med Sci Monit. (2021) 27:e928796. doi: 10.12659/MSM.928796
9. Yu F, Ma W, Li X, Zhang R, Kang F, Yang W, et al. Nomogram construction and evaluation for predicting non-remission after a single radioactive iodine therapy for Graves’ hyperthyroidism: a retrospective cohort study. Front Endocrinol (Lausanne). (2024) 15:1391014. doi: 10.3389/fendo.2024.1391014
10. Hoang TD, Stocker DJ, Chou EL, and Burch HB. 2022 update on clinical management of graves disease and thyroid eye disease. Endocrinol Metab Clin North Am. (2022) 51:287–304. doi: 10.1016/j.ecl.2021.12.004
11. Liao K, Wei X, Chen Y, Meng D, Mo S, Sun Z, et al. Development and validation of a nomogram prediction model for factors influencing 131I-refractory Graves’ hyperthyroidism. Front Endocrinol (Lausanne). (2025) 16:1628226. doi: 10.3389/fendo.2025.1628226
12. Iwanaga H, Fujita N, Abe S, Ito R, Abe Y, Minamimoto R, et al. Predictive factors for the efficacy of radioiodine therapy in patients with Graves’ disease. Nucl Med Commun. (2025). doi: 10.1097/MNM.0000000000002045 [Epub ahead of print].
13. Medicine CSoN. Clinical guidelines for 131I treatment of Graves’ hyperthyroidism (2021 edition). Chin J Nucl Med Mol Imaging. (2021) 41:242–53. doi: 10.3760/cma.j.cn321828-20201109-00405
14. Wang H, Yu D, Tan Z, Hu R, Zhang B, and Yu J. Estimation of thyroid volume from scintigraphy through 2D/3D registration of a statistical shape model. Phys Med Biol. (2019) 64:095015. doi: 10.1088/1361-6560/ab186d
15. Leslie WD, Ward L, Salamon EA, Ludwig S, Rowe RC, and Cowden EA. A randomized comparison of radioiodine doses in graves’ Hyperthyroidism. J Clin Endocrinol Metab. (2003) 88:978–83. doi: 10.1210/jc.2002-020805
16. Liu M, Jing D, Hu J, and Yin S. Predictive factors of outcomes in personalized radioactive iodine ((131)I) treatment for Graves’ disease. Am J Med Sci. (2014) 348:288–93. doi: 10.1097/MAJ.0000000000000288
17. Bonnema SJ and Hegedüs L. Radioiodine therapy in benign thyroid diseases: effects, side effects, and factors affecting therapeutic outcome. Endocr Rev. (2012) 33:920–80. doi: 10.1210/er.2012-1030
18. Zheng W, Jian T, Guizhi Z, Zhaowei M, and Renfei W. Analysis of 131I therapy and correlation factors of Graves’ disease patients: a 4-year retrospective study. Nucl Med Commun. (2012) 33:97–101. doi: 10.1097/MNM.0b013e32834d3bb9
19. Sabri O, Zimny M, Schreckenberger M, Reinartz P, Ostwald E, and Buell U. Radioiodine therapy in Graves’ disease patients with large diffuse goiters treated with or without carbimazole at the time of radioiodine therapy. Thyroid. (1999) 9:1181–8. doi: 10.1089/thy.1999.9.1181
20. Yang D, Xue J, Ma W, Liu F, Fan Y, Rong J, et al. Prognostic factor analysis in 325 patients with Graves’ disease treated with radioiodine therapy. Nucl Med Commun. (2018) 39:16–21. doi: 10.1097/MNM.0000000000000770
21. Xing YZ, Zhang K, and Jin G. Predictive factors for the outcomes of Graves’ disease patients with radioactive iodine (131I) treatment. Biosci Rep. (2020) 40:BSR20191609. doi: 10.1042/BSR20191609
22. Nordyke RA and Gilbert FI Jr. Optimal iodine-131 dose for eliminating hyperthyroidism in Graves’ disease. J Nucl Med. (1991) 32:411–6.
23. Kristoffersen US, Hesse B, Rasmussen AK, and Kjaer A. Radioiodine therapy in hyperthyroid disease: poorer outcome in patients with high 24 hours radioiodine uptake. Clin Physiol Funct Imaging. (2006) 26:167–70. doi: 10.1111/j.1475-097X.2006.00666.x
24. de Jong JA, Verkooijen HM, Valk GD, Zelissen PM, and de Keizer B. High failure rates after (131)I therapy in Graves hyperthyroidism patients with large thyroid volumes, high iodine uptake, and high iodine turnover. Clin Nucl Med. (2013) 38:401–6. doi: 10.1097/RLU.0b013e3182817c78
25. Di Martino F, Traino AC, Brill AB, Stabin MG, and Lazzer M. A theoretical model for prescription of the patient-specific therapeutic activity for radioiodine therapy of Graves’ disease. Phys Med Biol. (2002) 47:1493–9. doi: 10.1088/0031-9155/47/9/305
26. Sabri O, Zimny M, Schreckenberger M, Meyer-Oelmann A, Reinartz P, and Buell U. Does thyroid stunning exist? A model with benign thyroid disease. Eur J Nucl Med. (2000) 27:1591–7. doi: 10.1007/s002590000348
27. Happel C, Kranert WT, Ackermann H, Binse I, Bockisch B, Gröner D, et al. Thyroid stunning in radioiodine-131 therapy of benign thyroid diseases. Endocrine. (2019) 63:537–44. doi: 10.1007/s12020-018-01833-5
28. Moura-Neto A, Mosci C, Santos AO, Amorim BJ, de Lima MC, Etchebehere EC, et al. Predictive factors of failure in a fixed 15 mCi 131I-iodide therapy for Graves’ disease. Clin Nucl Med. (2012) 37:550–4. doi: 10.1097/RLU.0b013e31824851d1
29. Szumowski P, Abdelrazek S, Kociura Sawicka A, Mojsak M, Kostecki J, Sykała M, et al. Radioiodine therapy for Graves’ disease - retrospective analysis of efficacy factors. Endokrynol Pol. (2015) 66:126–31. doi: 10.5603/EP.2015.0019
30. Kageyama K, Kinoshita N, and Daimon M. A case of thyrotoxicosis due to simultaneous occurrence of subacute thyroiditis and graves’ Disease. Case Rep Endocrinol. (2018) 2018:3210317. doi: 10.1155/2018/3210317
Keywords: Graves’ hyperthyroidism, radioiodine therapy, free triiodothyronine change, non-complete remission, nomogram model
Citation: Wang C, Li Y, Qin G, Song Y, Yang X, Lu Y and Wang X (2025) A prognostic nomogram model for non-complete remission following initial radioiodine therapy in Graves’ hyperthyroidism. Front. Endocrinol. 16:1692702. doi: 10.3389/fendo.2025.1692702
Received: 03 September 2025; Accepted: 15 October 2025;
Published: 28 October 2025.
Edited by:
Joao Dts Anselmo, Hospital do Divino Espírito Santo, PortugalReviewed by:
Sarbjeet Makkar, University of Michigan, United StatesKatsuhiko Kato, Nagoya University, Japan
Lu Lu, People’s Hospital of Guangxi Zhuang Autonomous Region, China
Cyril Fersing, Institut du Cancer de Montpellier (ICM), France
Andreia Pataco, Hospital Divino Espírito Santo, Portugal
Copyright © 2025 Wang, Li, Qin, Song, Yang, Lu and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Xufu Wang, d2FuZ3h1ZnVAc2luYS5jb20=; Yaqi Lu, MTc4NTMyOTcxMjlAMTYzLmNvbQ==
†These authors have contributed equally to this work
‡ORCID: Congcong Wang, orcid.org/0000-0003-1793-3301
Xufu Wang, orcid.org/0000-0002-0849-5429
Yanhui Song1