ORIGINAL RESEARCH article

Front. Endocrinol., 24 March 2023

Sec. Pediatric Endocrinology

Volume 14 - 2023 | https://doi.org/10.3389/fendo.2023.1130580

A comprehensive validation study of the latest version of BoneXpert on a large cohort of Caucasian children and adolescents

  • 1. Department of Pediatrics, Second Faculty of Medicine, Charles University and Motol University Hospital, Prague, Czechia

  • 2. Department of Anthropology and Human Genetics, Faculty of Science, Charles University, Prague, Czechia

  • 3. Department of Probability and Mathematical Statistics, Faculty of Mathematics and Physic, Charles University, Prague, Czechia

Abstract

Introduction:

Automated bone age assessment has recently become increasingly popular. The aim of this study was to assess the agreement between automated and manual evaluation of bone age using the method according to Tanner-Whitehouse (TW3) and Greulich-Pyle (GP).

Methods:

We evaluated 1285 bone age scans from 1202 children (657 scans from 612 boys) by using both manual and automated (TW3 as well as GP) bone age assessment. BoneXpert software versions 2.4.5.1. (BX2) and 3.2.1. (BX3) (Visiana, Holte, Denmark) were compared with manual evaluation using root mean squared error (RMSE) analysis.

Results:

RMSE for BX2 was 0.57 and 0.55 years in boys and 0.72 and 0.59 years in girls, respectively for TW3 and GP. For BX3, RMSE was 0.51 and 0.68 years in boys and 0.49 and 0.52 years in girls, respectively for TW3 and GP. Sex- and age-specific analysis for BX2 identified the largest differences between manual and automated TW3 evaluation in girls between 6-7, 12-13, 13-14 and 14-15 years, with RMSE 0.88, 0.81, 0.92 and 0.84 years, respectively. The BX3 version showed better agreement with manual TW3 evaluation (RMSE 0.64, 0.45, 0.46 and 0.57).

Conclusion:

The latest version of the BoneXpert software provides improved and clinically sufficient agreement with manual bone age evaluation in children of both sexes compared to the previous version and may be used for routine bone age evaluation in non-selected cases in pediatric endocrinology care.

1 Introduction

The status of skeletal maturation is the most reliable indicator of biological age in children and adolescents. Bone age (BA) evaluation is a standard procedure widely used in children with growth failure and puberty disturbances. In addition, it is used in chronically ill patients as a complement to overall clinical health assessment (, ). BA is used successfully for the timing of orthopedic surgeries in children with uneven length of extremities or specific bone deformities as well.

An x-ray must comprise of the entire hand and wrist to be able to evaluate the bone age. The rationale for this lies in the fact that this skeletal site includes a large number of short bones of which the order and progression of ossification is very well known. Currently, the most common methods of evaluation are the Greulich and Pyle’s method (GP) published in 1959 () and the Tanner and Whitehouse 3 method (TW3), where the first edition was published in 1962 (). While the GP method evaluates the hand as a whole, the TW3 method assigns specific stages of skeletal maturation (1 through 9) to 13 individual pre-determined bones of the hand and wrist (the so-called Radius-Ulna-Short bones, RUS).

Although manual assessment of bone age using both the GP and TW3 methods is reliable if performed by a highly experienced specialist, its main disadvantage is the subjective nature of the procedure. The bone age result of two distinct expert raters may differ by up to a year (, ). Thus, automated methods of bone age assessment using software-based morphometric analysis of digitally acquired hand and wrist x-rays have been introduced to clinical practice in the last few years, aiming to eliminate the inherent subjective aspect of the manual work-up and save time. The most sophisticated and currently widely used method of automated bone age analysis works on the platform of the BoneXpert software developed by Visiana (Holte, Denmark). In brief, the software delineates the distal epiphyses of the radius, ulna, metacarpals and phalanges. At least eight bones need to be scored to compute bone age (). Detailed functioning of the software has been described previously (, ).

While the first two commercially released versions of the software already underwent validation with real clinical cases (, ), the latest release (issued in 2020) that aimed to improve the limitations of former versions, has not yet been independently tested.

The aims of this study were: 1) to compare manual and automated bone age assessment using BoneXpert software versions BX2 and BX3 using both the GP and TW3 methods in a large cohort of children with various disorders, ages, and sexes, 2) to explore whether the TW3 bone age outcome is affected by differences in the evaluation of individual bones between manual and automated methods.

2 Participants and methods

2.1 Participants

This cross-sectional retrospective study included 1285 radiographs from 1202 non-selected children and adolescents aged 5 to 16 years (657 scans in 612 boys and 628 scans in 590 girls). All radiographs done for the purpose of bone age assessment at Motol University Hospital between January 2018 and January 2019 were collected. Patients with an abnormal bone structure (e.g. skeletal dysplasia) and patients of non-Caucasian ethnicity1 were excluded from the analysis. The software rejected 8 images for poor quality or having an incorrect hand position. Sex-specific one-year age categories were created for girls between 5 and 15 years and boys between 5 and 16 years. Each one-year category included a minimum of 50 radiographs.

This study was approved by the Ethics Committee of the Motol University Hospital (Reference No.: EK-264/18) and complied with the Declaration of Helsinki.

2.2 Bone age assessment

After the bone age scan of the left hand and wrist was taken, each image was evaluated manually by one of two experienced raters (M.K. or Z.D.) using both the TW3 () and GP () methods (only patients sex was disclosed, chronological age was calculated after bone age assessment, diagnosis was not provided to the rater). All images were sent in a standard DICOM format (Digital Imaging and Communications in Medicine) for evaluation using automated bone age assessment software BoneXpert (Visiana, Holte, Denmark). No post-processing was applied to the x-rays. The software input consists of patient’s sex, birth date and date of x-ray scan. The BX2 version was used for the purpose of clinical practice, the same images were then reevaluated by the BX3 version as well. This was used only for the purpose of validating the program (the BX3 version was kindly provided by Visiana in form of a StandAlone program for independent evaluation).

If the absolute difference between the manual and automated bone age assessment was more than 1.5 year (an arbitrarily set cut-off in either the GP or TW3 method) the images were reevaluated by an experienced independent rater (S.P.), a medical anthropologist with no affiliation to the Motol University Hospital. An average of the two manual assessments was used for statistical analysis in these cases (N = 70).

2.3 Statistical analysis

Throughout the analysis, repeated measurements on the same child were treated as independent observations as they were gained at different visits.

The Bland-Altman analysis was used to determine the character of differences between the automated and manual approach. For each patient, Bland-Altman plots the difference between the automated and manual assessment against the mean of the two methods, or alternatively, against the values of one of the two methods. In this analysis, differences were plotted against the results of the manual method. The graphs indicate where the automated method produced higher or lower values in comparison to the manual method, possible bias (mean of differences) and lower and upper limit of accuracy (LOA), computed as bias ± 2×standard deviation (SD). Bias of each method was tested using a one-sample t-test, the bias between BX2 and BX3 were compared using paired t-tests.

To explore the size of differences between manual and automated bone age assessment in general and in various categories (defined by sex and/or age and diagnosis), Root Mean Squared Errors (RMSE) were calculated using the standard formula ():

where:

  • Σ = summation

  • (zfi-zoi)2 = differences, squared

  • N = sample size

Confidence intervals for RMSE were computed under the assumption of symmetry of deviations of BoneXpert estimates compared to manual assessment. Accuracy of BX2 with respect to manual assessment was compared to the accuracy of BX3 with respect to manual assessment using the Diebold-Mariano test ().

In the detailed analysis of the TW3 method, the difference between stages assigned by manual and automated method were compared using ANOVA F-test and post-hoc pairwise comparisons with Benjamini-Hochberg correction for multiple comparisons. The differences in assigned bone stages were tested in all available scans divided into 3 groups according to the difference in the final bone age (BX higher than manual by >1.0 year; BX lower than manual by > 1.0 year; BX not different from manual, i.e.<1.0 and > - 1.0 year). In bones showing the greatest differences in assigned bone stages, the effect on resulting bone age was tested.

All analyses were performed in statistical language and environment R, version 4.1.2 (). The level of statistical significance was set to 0.05 throughout the analysis. In case of multiple comparisons adjustment (such as testing in various age-, sex- or diagnosis-specific categories), the Benjamini-Hochberg method was used.

3 Results

3.1 Comparison between automated and manual bone age assessment in children according to sex and age

Using the TW3 method, the BX2 version generally underestimated bone age in both sexes, whereas the BX3 version performed comparably to the manual assessment with mean of the differences close to zero (Table 1 – the data are given in years). On the other hand, BX3 performed significantly worse using the GP method compared to BX2 version in boys (Table 1). In particular, while BX2-assessed GP bone age did not differ from manually assessed GP bone age in boys, the BX3 version significantly overestimated GP bone ages. In girls, both BX2 and BX3 slightly underestimated GP bone age compared to manual evaluation.

Table 1

TW3GP
NBX2 – MANBX3 – MANBX2 – MANBX3 – MAN
mean (SD)Pmean (SD)Pmean (SD)pmean (SD)P
Boys657-0.19 (0.54)< 0.0001-0.01 (0.51)0.239-0.00 (0.55)0.9240.39 (0.56)< 0.0001
Girls628-0.47 (0.55)< 0.0001-0.02 (0.49)0.635-0.23 (0.55)< 0.0001-0.10 (0.51)< 0.0001

Overall means of differences in years between automated and manual bone age assessment, separately for both sexes and software versions (BX2 and BX3).

P-values for one-sample t-test examining the difference from zero.

TW3, bone age assessment according to Tanner-Whitehouse 3 method; GP, bone age assessment according to Greulich-Pyle method; BX2, BoneXpert version 2.4.5.1.; BX3, BoneXpert version 3.0.3.; MAN, manual bone age assessment.

The differences between automated and manual bone age results are presented in detail in Bland-Altman graphs in Figure 1. The best agreement was observed in the BX3 version using the TW3 method in both sexes (Figure 1B).

Figure 1

These findings were further supported by the RMSE analysis showing that the BX3 version has significantly better agreement with manual bone age assessment than the BX2 version in both sexes using the TW3 method and in girls using the GP method as well (Table 2 - the data are given in years). In contrast, the BX3 version performed worse than BX2 in boys using the GP method.

Table 2

TW3GP
NBX2 vs MANBX3 vs MANpBX2 vs MANBX3 vs MANP
Boys6570.57 (0.54-0.61)0.51 (0.48-0.54)*0.00070.55 (0.52-0.58)0.68 (0.64-0.72) #< 0.0001
Girls6280.72 (0.69-0.77)0.49 (0.47-0.52)*< 0.00010.59 (0.56-0.63)0.52 (0.49-0.55)*< 0.0001

Root mean square errors of automated vs. manual bone age assessment, separately for both sexes and software versions (BX2 and BX3).

Root mean square errors (and corresponding 95% confidence intervals) are shown (in years).

p-value: Diebold-Mariano test for method accuracy (* BX3 performs significantly better than BX2 α= 0.05, # BX3 performs significantly worse than BX2 at α =0.05).

TW3, bone age assessment according to Tanner-Whitehouse 3 method, GP, bone age assessment according to Greulich-Pyle method, BX2, BoneXpert version 2.4.5.1., BX3, BoneXpert version 3.0.3., MAN, manual bone age assessment.

Sex- and age-specific RMSE for the BX2 version using the TW3 method showed that the largest differences between automated and manual bone age were present in girls aged 6-7 and 12-15 years (Figure 2). When using the BX3 version, the agreement between automated and manual bone age improved significantly in 8/10 age categories in girls, when compared to BX2. For the GP method, BX2 showed significantly larger RMSE than the BX3 version only in girls aged 7-8 years.

Figure 2

In boys, the BX3 version showed improvement of the TW3 method in 4 age categories (9-10, 11-12, 13-14 and 15-16 years), compared to BX2 (Figure 2). In contrast, the RMSEs between manual and automated bone age evaluation were larger when using the BX3 version compared to BX2 using the GP method in boys, in particular for ages 6-8 and 9-10 years. The RMSE numeric values (in years) are presented in Supplementary Table 1.

The absolute difference in bone age result > 1.0 year was noted in 7.5% and 6.2% scans in boys and 16.4% and 8.4% scans in girls, for TW3 and GP respectively, when using the BX2 version. The BX3 version showed > 1.0 year difference in 6.3% and 12.8% scans in boys and 6.0% and 5.3% scans in girls for TW3 and GP, respectively.

3.2 Agreement between automated and manual bone age assessment in children with various diagnoses

The RMSE analysis confirmed that the best agreement between automated and manual bone age evaluation was reached when using the TW3 method in BX3, regardless of the patient’s disease (Figure 3). Disease-specific RMSEs are shown in Supplementary Table 2.

Figure 3

The disease specific mean differences between automated and manual bone age values showed that the TW3 BX2 bone age differed significantly from manual evaluation in 16/24 disease groups. BX3 showed significant improvement, only children with growth hormone deficiency differed significantly from manual testing. The particular differences given in years are shown in Supplementary Figure 1.

3.3 Detailed analysis of the TW3 method: Differences of the automated and manual evaluation of particular bones and the effect on the outcome of the final bone age

A detailed analysis of the TW3 method was carried out on 1206 scans with detailed data on individual bones available. Out off these, 145 BX2 assessments (12.0%) differed by more than 1 year from the manual assessment, most of these (139) being lower than the manually estimated bone age. Seventy-four BX3 assessments (6.1%) differed by more than 1 year from the manual assessment (while being much more equally distributed: 47 were lower and 27 higher than the manually assessed bone age).

For each automated bone age software version and each group according to whether automated assessment resulted in the bone age being 1) > 1.0 year higher, 2) >1.0 year lower, or 3) less than one year different from the manually assessed bone age, differences in individual bone scores for each of the 13 bones were examined graphically (Supplementary Figure 2) and by using the ANOVA method with post-hoc pairwise comparisons. Out of these radius and ulna showed larger differences in assigned bone score among other bones (ANOVA F-test p< 0.001).

While focusing only on those x-rays where the ulna and/or radius scoring differed by more than 1 stage between automated and manual assessment,we have identified 90 such scans for the ulna with the BX2 version (85 underestimated and 5 overestimated scores) and 42 scans with BX3 (24 underestimated and 18 overestimated scores). For the radius, there were only 7 and 0 cases for BX2 and 3 and 0 cases for BX3, with under- and overestimated scores, respectively. In scans where BX3 under- or over-estimated the evaluation of the ulna, the mean difference between the automated (BX3) and manual bone age deviated significantly from 0 (p< 0.001) however the mean difference did not exceed 1 year (Figure 4 and Supplementary Table 3). The absolute difference in bone age exceeded 1 year (N = 15; median absolute difference 1.2 years; IQR 1.1-1.3 years) only in a minority of these cases and there was no discernable pattern in sex or diagnoses.

Figure 4

4 Discussion

The objective of this study was to explore the clinical utility of the BoneXpert automated bone age assessment on a large unselected cohort of children. We showed that the latest BoneXpert version (BX3) performed comparably to expert manual bone age reading in a large cohort of Caucasian children and that it performed better than the previous BoneXpert version (BX2). In particular, BX2-inherent underestimation of TW3 bone age, which was more pronounced in girls, was completely abolished in the newer BX3 version. The TW3 bone age assessed by the BX3 performed best among myriad of diseases as well, in which bone age is typically evaluated. Thus, this study encourages the use of automated TW3 bone age assessment in daily clinical practice.

Validation of automated bone age assessment is typically done by comparing the result to bone age assessed manually by a highly experienced individual. We showed that the BX2 version underestimated TW3 bone age especially in girls aged 6 to 7 and 12 to 15 years, when compared to manually-assessed TW3 bone age. Our results were similar to a previous study in participants of the First Zurich Longitudinal Study, where the differences between automated and manual TW3 bone age assessment (RMSEs) were reported to be 0.67 years in boys and 0.63 years in girls (). The authors () noted considerable variability between individual age categories but did not show the data in extenso. Interestingly, our study showed that this inherent limitation of the BX2 version has been abolished in the latest software version (BX3).

There are no studies published comparing the TW3 bone age outcome between BX2 and BX3, only a single previous study explored the performance of the first (BX1) and third (BX3) software versions with regard to GP bone age (, ). In the Caucasian population a RMSE of 0.66 and 0.51 years in boys and 0.50 and 0.48 years in girls was reported, for BX1 and BX3 respectively. This was similar to our study, in which the BX3 version of GP bone age differed from the manual rating by 0.68 and 0.52 years in boys and girls respectively. Interestingly the GP results reported by Martin et al. () were in significantly worse agreement in girls of African descent (RMSE 0.75 years). On the other hand, a similar study on children of Indian ethnicity found the agreement between manual and automated GP bone age in girls to be 0.39 years (RMSE) (). As both GP and TW3 methods are based on the Caucasian population, the causes are probably the differences in skeletal maturation among different ethnicities, geographical location and socioeconomic status (, , ) - in the Czech Republic the agreement between sexual maturation and bone age provided by the GP and TW3 methods has been well established ().

To enhance clinical utility, automated bone age analysis needs proper validation in individual diseases. The BoneXpert software was introduced in 2009 () and the agreement of the first version with GP manual rating has been evaluated in children with a few common endocrine disorders (). Our study explored the agreement between automated and manual bone age assessment in a large unselected group of disorders that can be commonly encountered in pediatric clinical practice. We showed that the BX3 version TW3 method performs consistently across various disorders. Interestingly, the RMSE for the TW3 method of the BX3 version were lower than the RMSE for GP in the first version of the software () in children with growth hormone deficiency or Turner syndrome (0.50 vs. 0.71 and 0.48 vs. 0.75, respectively). These results further support the use of the latest TW3 BoneXpert version in clinical practice.

In every automated analysis algorithm, systemic scoring errors should be excluded to avoid improper bone age assessment. The automated TW3 assessment by BoneXpert displays the scoring of individual bones, which allows for a more in-depth analysis. We showed that automated ulna scoring resulted in larger differences from the manual scores compared to the other bones. However, this did not have a significant influence on the TW3 bone age value. This eliminates the possibility that the differences between automated and manual TW3 bone age values may be due to systemic errors in the evaluation of a particular bone.

The strengths of this study are: 1) the large cohort of patients of Caucasian descent with various disorders, representing the common clinical situation, in whom we validated the latest version of automated GP as well as TW3 bone age assessment provided by BoneXpert, 2) the direct comparison between the latest software version (BX3) and the previous widely used version (BX2) and 3) the in depth analysis of the TW3 method.

As a limitation of this study we recognize: 1) the homogeneous cohort of children with Caucasian descent, therefore we recommend caution when applying our results to the non-Caucasian population, 2) that the disease-specific RMSEs were not further analyzed with regard to sex. This was due to relatively low number of children in certain groups with rare disorders and because we found no statistically significant difference between boys and girls in the overall RMSE analysis of the TW3 BX3 version.

The strengths of BoneXpert software include: 1) time efficiency - the number of specialists that spent more than 2 minutes evaluating an image decreased from 86 to 21% after installation of BoneXpert (), 2) ease of use, 3) validation in different ethnicities () and various disorders (), and 4) wide use (). On the other hand 1) cost effectiveness in lower income countries may be an issue and 2) precision was not yet established.

5 Conclusion

Bone age analysis provided by the most recent BoneXpert software version showed clinically reliable agreement with manual evaluation among wide range of chronic diseases of children. BoneXpert is therefore a good alternative to manual rating. There are few relevant clinical implications for the use of BoneXpert in clinical practice. The major advantage is the ability to save time of the experienced evaluators. Manual bone age analysis could thus be reserved for cases where automated analysis performs improbably (i.e., discrepancy between bone age and sexual maturation) or is not feasible (i.e., skeletal dysplasia).On the other hand, bone morphology and structure, besides the bone age assessment, is routinely evaluated as part of the manual workup. The automated system does not provide such a feature. Thus, patients with mild to moderate skeletal dysplasia (which is clinically discrete) may escape the appropriate medical attention.

Statements

Data availability statement

All data generated and analyzed in this study are available from the corresponding author on reasonable request.

Ethics statement

The studies involving human participants were reviewed and approved by the Ethics Committee of the Motol University Hospital (Reference No.: EK-264/18). Written informed consent from the participants’ legal guardian/next of kin was not required to participate in this study in accordance with the national legislation and the institutional requirements.

Author contributions

KM and DZ contributed equally to the conception, design and data collection. PS performed the independent reevaluation of selected bone age scans. MP performed the statistical analysis. KM wrote the draft of the manuscript and ZS, OS, HK and SA were involved in data analysis and editing of the manuscript. All authors contributed to the article and approved the submitted version.

Funding

This study was supported by a grant from the Czech Ministry of Health (conceptual support project to research organization 00064203 - FN Motol).

Acknowledgments

We thank Hans Henrik Thodberg, the owner of the company that develops BoneXpert, for providing BoneXpert Stand-Alone software version 3.0.3. for the reevaluation of the images previously assessed in clinical practice by BoneXpert version 2.4.5.1.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fendo.2023.1130580/full#supplementary-material

Footnotes

1.^According to the 2021 census, the Czech population is homogeneous, the largest minority is of Vietnamese descent and makes up only 0.4% of the population ().

References

  • 1

    CohenPRogolADDealCLSaengerPReiterEORossJLet al. Consensus statement on the diagnosis and treatment of children with idiopathic short stature: A summary of the growth hormone research society, the Lawson Wilkins pediatric endocrine society, and the European society for paediatric endocrinology workshop. J Clin Endocrinol Metab (2008) 93(11):4210–7. doi: 10.1210/jc.2008-0509

  • 2

    Bangalore KrishnaKFuquaJSRogolADKleinKOPopovicJHoukCPet al. Use of gonadotropin-releasing hormone analogs in children: Update by an international consortium. Horm Res Paediatr (2019) 91(6):357–72. doi: 10.1159/000501336

  • 3

    GreulichWWPyleIS. Radiographic atlas of skeletal development of the hand and wrist. 2nd ed. Stanford: Stanford University Press (1959) 256.

  • 4

    TannerJMHealyMJRCameronNGoldsteinH. Assessment of skeletal maturity and prediction of adult height (TW3 method). 3rd ed. RussellD, editor. London: Harcourt Publishers Limited (2001). 110. Available at: https://books.google.cz/books?id=KKdxQgAACAAJ.

  • 5

    RocheAFRohmannCGFrenchNYDávilaGH. Effect of training on replicability of assessments of skeletal maturity (Greulich-pyle). Am J Roentgenol Radium Ther Nucl Med (1970) 108(3):511–5. doi: 10.2214/ajr.108.3.511

  • 6

    Van RijnRRThodbergHH. Bone age assessment: Automated techniques coming of age? Acta Radiol (2013) 54:1024–9. doi: 10.1258/ar.2012.120443

  • 7

    ThodbergHHKreiborgSJuulAPedersenKD. The BoneXpert method for automated determination of skeletal maturity. IEEE Trans Med Imaging (2009) 28(1):5266. doi: 10.1109/TMI.2008.926067

  • 8

    MartinDDCalderADRankeMBBinderGThodbergHH. Accuracy and self-validation of automated bone age determination. Sci Rep [Internet] (2022) 12(1):112. doi: 10.1038/s41598-022-10292-y

  • 9

    Rijn vanRRLequinMHThodbergHH. Automatic determination of greulich and pyle bone age in healthy Dutch children. Pediatr Radiol (2009) 39:591–7. doi: 10.1007/s00247-008-1090-8

  • 10

    ThodbergHHJenniOGRankeMBMartinDD. Standardization of the tanner-whitehouse bone age method in the context of automated image analysis. Ann Hum Biol (2012) 39(1):6875. doi: 10.3109/03014460.2011.642405

  • 11

    Czech Statistical Office. Census 2021 - ethicity. Census 2021 (2021). Available at: www.czso.cz/csu/scitani2021/ethnicity.

  • 12

    AvdeefA. Do you know your r2? ADMET DMPK (2021) 9(1):6974. doi: 10.5599/admet.888

  • 13

    DieboldFXMarianoRS. Comparing predictive accuracy. J Bus Econ Stat (1995) 13:253–63.

  • 14

    R Core Team. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing (2021). Available at: https://www.r-project.org/.

  • 15

    ThodbergHHSaL. Validation and reference values of automated bone age determination for four ethnicities. Academic Radiology (2010) 6):1425–32. doi: 10.1016/j.acra.2010.06.007

  • 16

    OzaCKhadilkarAVMondkarSGondhalekarKLadkatAShahNet al. A comparison of bone age assessments using automated and manual methods in children of Indian ethnicity. Pediatr Radiol (2022) 52(11):2188–96. doi: 10.1007/s00247-022-05516-2

  • 17

    WangYMTsaiTHHsuJSChaoMFWangYTJawTS. Automatic assessment of bone age in Taiwanese children: A comparison of the greulich and pyle method and the tanner and whitehouse 3 method. Kaohsiung J Med Sci (2020) 36(11):937–43. doi: 10.1002/kjm2.12268

  • 18

    BowdenJJBowdenSARuessLAdlerBHHuHKrishnamurthyRet al. Validation of automated bone age analysis from hand radiographs in a north American pediatric population. Pediatr Radiol (2022) 52(7):1347–55. doi: 10.1007/s00247-022-05310-0

  • 19

    KrasnicanovaHKuchynkovaI. New method of assessment of bone age TW3 and first results of its application in the Czech republic. Česko-slovenská Pediatr (2002) 57(2):62–5.

  • 20

    MartinDDMeisterKSchweizerRRankeMBThodbergHHBinderG. Validation of automatic bone age rating in children with precocious and early puberty. J Pediatr Endocrinol Metab (2011) 24:1009–14. doi: 10.1515/JPEM.2011.420

  • 21

    MartinDDHeilKHeckmannCZierlASchaeferJRankeMBet al. Validation of automatic bone age determination in children with congenital adrenal hyperplasia. Pediatr Radiol (2013) 43:1615–21. doi: 10.1007/s00247-013-2744-8

  • 22

    MartinDDDeuschDSchweizerRBinderGThodbergHHRankeMB. Clinical application of automated greulich-pyle bone age determination in children with short stature. Pediatr Radiol (2009) 39:598607. doi: 10.1007/s00247-008-1114-4

  • 23

    ThodbergHHThodbergBAhlkvistJOffiahAC. Autonomous artificial intelligence in pediatric radiology: The use and perception of BoneXpert for bone age assessment. Pediatr Radiol (2022) 52(7):1338–46. doi: 10.1007/s00247-022-05295-w

Summary

Keywords

bone age, Tanner-Whitehouse, Greulich-Pyle, BoneXpert, validation study

Citation

Maratova K, Zemkova D, Sedlak P, Pavlikova M, Amaratunga SA, Krasnicanova H, Soucek O and Sumnik Z (2023) A comprehensive validation study of the latest version of BoneXpert on a large cohort of Caucasian children and adolescents. Front. Endocrinol. 14:1130580. doi: 10.3389/fendo.2023.1130580

Received

23 December 2022

Accepted

16 February 2023

Published

24 March 2023

Volume

14 - 2023

Edited by

Gianluca Tornese, Institute for Maternal and Child Health Burlo Garofolo (IRCCS), Italy

Reviewed by

Gerdi Tuli, Regina Margherita Hospital, Italy; Roland Schweizer, University Children’s Hospital Tübingen, Germany; Alistair Duncan Calder, Great Ormond Street Hospital for Children NHS Foundation Trust, United Kingdom

Updates

Copyright

*Correspondence: Klara Maratova,

This article was submitted to Pediatric Endocrinology, a section of the journal Frontiers in Endocrinology

Disclaimer

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Outline

Figures

Cite article

Copy to clipboard


Export citation file


Share article

Article metrics