Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Plant Sci., 19 December 2025

Sec. Sustainable and Intelligent Phytoprotection

Volume 16 - 2025 | https://doi.org/10.3389/fpls.2025.1716546

This article is part of the Research TopicAccurate Measurement and Dynamic Monitoring of Forest ParametersView all 14 articles

Predicting individual tree diameter at breast height for genetically diverse Catalpa bungei using nonlinear mixed-effects models and UAV LiDAR data

Yang Zhang,&#x;Yang Zhang1,2†Miaomiao Zhang&#x;Miaomiao Zhang3†Qiao Chen,*Qiao Chen1,2*Liyong Fu,Liyong Fu1,2Wenjun MaWenjun Ma3Guangshuang DuanGuangshuang Duan4Xinru Fu,Xinru Fu1,2Ziyan Zheng,Ziyan Zheng1,2Chuangye WuChuangye Wu5Qingqing WangQingqing Wang5Yuheng ShunYuheng Shun5Pan LiPan Li5
  • 1Institute of Forest Resource Information, Chinese Academy of Forestry, Beijing, China
  • 2Key Laboratory of Forest Management and Growth Modelling, National Forestry and Grassland Administration, Beijing, China
  • 3State Key Laboratory of Tree Genetics and Breeding , Key Laboratory of Tree Breeding and Cultivation of State Forestry Administration, Research Institute of Forestry, Chinese Academy of Forestry, Beijing, China
  • 4Xinyang Normal University, Xinyang, Henan, China
  • 5Wen County Institute of Forestry Science, Jiaozuo, Henan, China

Introduction: Diameter at breast height (DBH) is a key parameter for assessing tree growth, carbon storage, and ecological functions. Traditional ground surveys are inefficient, labor-intensive, and terrain-limited, making them unsuitable for large-scale monitoring. Airborne LiDAR, as an advanced remote sensing tool, provides an efficient and non-destructive method for DBH estimation. However, most existing LiDAR-based models overlook the influence of genotype differences, limiting prediction accuracy.

Methods: In this study, we used data from 2,899 Catalpa bungei trees of different genotypes to develop a nonlinear mixed-effects (NLME) model that incorporates genotype as a random effect. This approach improved model generalizability by using LiDAR-derived tree height (LH) and LiDAR-derived crown diameter (LCD) as core predictors. Multiple sampling strategies were also evaluated to assess their impact on model performance.

Results: The results showed that, considering genotype effects, the proposed NLME model outperformed both traditional regression models and dummy-variable models (R2 = 0.8624, RMSE = 1.1330, TRE = 3.9555), demonstrating the important role of genotype differences in improving model accuracy. Random sampling further improved prediction accuracy while effectively reducing measurement costs.

Discussion: This research introduces a new framework for integrating genotype variability into DBH prediction models and offers valuable insights for future LiDAR-based studies in genetically heterogeneous plantations. The findings provide technical support for forest management and ecosystem monitoring, as well as a methodological foundation for predicting tree growth under varying site and genetic conditions.

1 Research background

Catalpa bungei is a high-value, native tree species in China, known for its timber, medicinal, and ornamental uses. It is primarily distributed in the middle and lower reaches of the Yellow River and the North and Central China regions, with recent introductions in southern and southwestern areas (Shi et al., 2025; Wang et al., 2016). The species exhibits rich genetic diversity, with germplasm resources spanning 19 provinces. Phenotypic and molecular marker studies have revealed significant intraspecific variation, providing a basis for the selection of superior clones (Xiao et al., 2019). Under the influence of climate warming, the potential suitable distribution zone of Catalpa bungei is shifting northward, indicating promising ecological restoration and afforestation value in northern and northeastern China (Shi et al., 2025).

Diameter at Breast Height(DBH) is one of the most fundamental attributes in forest ecology, providing a basis for estimating tree biomass, growth rates, and competitive interactions. It also serves as a proxy for ecological indicators such as forest productivity, structural complexity, and habitat quality. Therefore, improving DBH prediction accuracy is not only relevant to forestry practice but also directly contributes to more reliable ecological monitoring and indicator development.

Given the ecological significance of DBH, a thorough understanding of the intrinsic mechanisms driving its variation is essential for accurate modeling and ecological interpretation. The coordinated variation among crown diameter(CD), tree height(H), and DBH reflects the integrated physiological and biomechanical processes underlying tree growth. Crown expansion regulates light interception and photosynthetic capacity, thereby determining the amount of carbon assimilates available for cambial activity and secondary xylem development. A larger canopy enhances gross primary production (GPP) and promotes radial growth through greater carbon allocation to the stem (Li et al., 2014). As trees grow taller, the hydraulic path between roots and leaves lengthens, increasing water transport resistance and reducing leaf water potential; to maintain hydraulic efficiency, trees must increase sapwood area and conduit density, resulting in stem thickening (Ryan and Yoder, 1997; Mencuccini, 2003). Meanwhile, increasing height amplifies mechanical stress caused by wind and self-weight, requiring proportional increases in stem diameter to preserve structural stability and prevent bending or buckling (King, 2005). Collectively, these physiological, hydraulic, and mechanical constraints give rise to the allometric scaling relationships among H, CD, and DBH, representing an adaptive balance between carbon assimilation, water transport, and structural support in woody plants.

DBH is a critical indicator for assessing tree growth, carbon storage, and resource structure (Zhang et al., 2022; Wang et al., 2024; Ouyang et al., 2024; He et al., 2025), but traditional field measurements are time-consuming, labor-intensive, and severely constrained by topography. These limitations are especially problematic in large-scale forest monitoring. In recent years, the rapid development of remote sensing, particularly LiDAR technology, has offered an efficient and systematic solution for extracting forest structural information (Borsah et al., 2023; Balestra et al., 2024; De Conto et al., 2024), becoming a vital tool for large-scale DBH estimation. Airborne LiDAR can penetrate the forest canopy to accurately acquire three-dimensional structural parameters such as H and CD (Liu et al., 2020; Zhou et al., 2022). Compared with traditional methods, it is non-destructive, highly efficient, and suitable for complex terrains and dense forests (Dalponte et al., 2018; Oehmcke et al., 2022; Xiang et al., 2024).

Despite its advantages in structural parameter extraction, current studies using airborne LiDAR still face several challenges. Most DBH prediction models rely on fixed-effect assumptions and often overlook the potential impact of genotype variation and plot-specific conditions on tree growth. In stands with pronounced genetic heterogeneity, traditional models fail to capture the regulatory role of genotype in growth dynamics, limiting DBH estimation accuracy. Therefore, developing modeling approaches that integrate both fixed and random effects has become a critical pathway for improving DBH prediction accuracy.

In recent years, NLME models have been increasingly applied in forestry to describe the hierarchical structure of forest data and to improve prediction accuracy by accounting for both fixed and random sources of variation. They have been successfully used to model diverse forest attributes, including height–diameter relationships (Bronisz and Mehtätalo, 2020; Xu et al., 2024; Ismail et al., 2025), crown width and canopy structure (Raptis et al., 2018; Zhou et al., 2023), as well as stand biomass and growth dynamics (Qin et al., 2025; Zhu et al., 2025). These studies collectively demonstrate that incorporating stand-level or environmental variables—such as mean DBH, stand density, basal area, and climatic factors—as random effects or covariates can substantially enhance model flexibility and predictive power across forest types. The widespread adoption of NLME frameworks highlights their robustness and versatility for representing structural heterogeneity and complex ecological processes in both even- and uneven-aged forests.

Research on growth trait variation among Catalpa bungei genotypes has also progressed in recent years. A study systematically evaluated the growth performance of Catalpa bungei clones through multi-site trials. The results revealed significant genetic differences in DBH, H, and stem volume among genotypes, along with relatively high repeatability and moderate-to-high levels of genetic control. These findings provide a solid foundation for the selection and deployment of superior genotypes (Xiao et al., 2025). Additionally, Clonal trials of Paulownia species were conducted in both temperate and subtropical regions. The study found significant genotype-by-environment interactions in growth traits such as DBH and height, with genetic gains ranging from 18.05% to 46.03%, further confirming the critical influence of genotype on tree growth (Zhao et al., 2022).

Although NLME models have been widely applied in forestry modeling, most existing studies have treated plot, site, or stand characteristics as random effects. However, few studies have attempted to incorporate genotypic variation into the model structure to characterize individual growth variability and stochastic fluctuations from a genetic perspective. Previous DBH prediction models often overlooked the regulatory influence of genetic background on growth dynamics, resulting in limited prediction accuracy in stands with high genetic heterogeneity.

In this study, genotype was introduced as a random effect within the NLME framework for DBH modeling, enabling a systematic characterization of growth variation and random fluctuations among different genotypes. This approach substantially enhances the explanatory power and generalization ability of the model. The proposed innovation not only overcomes the limitations of conventional fixed-effect modeling but also provides a novel theoretical and technical framework for fine-scale DBH estimation and large-scale ecological monitoring in genetically diverse forest stands.

2 Materials and methods

2.1 Study area

The study area is located in Wenxian, Jiaozuo City, Henan Province, China (longitude 112°51’39”E–113°13’20”E, latitude 34°52’N–35°2’48”N), situated in the southwestern part of the North China Plain. The study area’s location and topographic features are illustrated in the map (Figure 1).The region lies at a relatively low elevation, ranging from 50 to 150 meters, with a terrain characterized by plains and low hills, gently sloping from northwest to southeast, forming a distinctive microclimate. The area has a temperate monsoon climate, with an average annual temperature of approximately 14 °C to 15 °C and annual precipitation between 600 and 700 mm. The predominant soil type is yellow-brown soil, which has a moderate texture and good drainage, suitable for the growth of various plant species. Wenxian, located in northwestern Henan Province and near the Yellow River, offers favorable natural conditions for ecological research and sustainable development.

Figure 1
Map and aerial image showing Wen County in Henan Province, China. The map highlights the study area with red, Wen County in yellow, Henan Province in pink, and China in light blue. An aerial image details the study area, delineating three plots in red, green, and blue. Legends and scale bars are included for reference.

Figure 1. Overview of the study area.

To investigate the growth characteristics of Catalpa bungei genotypes, three different treatment plots were established in the study area (Figure 2): Plot 1: High-density planting with standard fertilization, spacing 2 m × 2 m, arranged in 120 rows × 10 columns. A total of 200 genotypes were included, with two trees per genotype, planted in triplicate. Plot 2: Control group with standard fertilization, spacing 4 m × 4 m, arranged in 60 rows × 20 columns. A total of 200 genotypes, two trees per genotype, planted in triplicate. Plot 3: Nitrogen-deficient treatment with spacing 4 m × 4 m, 60 rows × 20 columns, also with 200 genotypes, two trees per genotype, and three replications.

Figure 2
Three experimental plots are illustrated. Plot 1: high density, fertilized, 18 meters by 238 meters, with identical genotypes. Plot 2: normal density, fertilized, 76 meters by 236 meters, with different genotypes. Plot 3: normal density, nitrogen-deficient, 76 meters by 236 meters. Each plot features colored dots representing genetic variation.

Figure 2. Schematic diagram of the Catalpa bungei treatment plots.

2.2 Data collection and preprocessing

Ground-based data were collected during 2021, 2023, and 2024. The measurement instruments used for DBH, H, and CD include DBH calipers, telescopic height poles, and laser hypsometers. The specifications, precision, and resolution of the instruments are shown in Table 1. Within each plot, all upright and living Catalpa trees were measured for: DBH,H and CD. Representative vegetation types in the sample plots are shown in the field photographs (Figure 3). Table 2 presents the descriptive statistics of the measured tree variables.

Table 1
www.frontiersin.org

Table 1. Specifications and accuracy of ground-based measurement instruments.

Figure 3
A grove of trees with white trunks stands in neat, evenly spaced rows. The ground is covered with fallen leaves and patches of sunlight filter through the green canopy above.

Figure 3. Schematic diagram of the Catalpa bungei treatment plots.

Table 2
www.frontiersin.org

Table 2. Descriptive statistics of tree variables (SD, standard deviation; DBH, diameter at breast height; H, measured tree height; CD, measured crown width; LH, LiDAR-derived tree height; LCD, LiDAR-derived crown diameter).

UAV LiDAR data were also collected in 2021, 2023, and 2024 using the BB-4 Bumblebee UAV platform. The onboard sensor was the AlphaUni AS-1300HL long-range multi-platform LiDAR system by CHCNAV. This system uses a 1550 nm Class 1 eye-safe laser, supports 360°full-scan mode, with a maximum range of 1845 m (for targets with >80% reflectivity), a maximum pulse rate of 1.5 MHz, single-shot precision of 15 mm, and repeatability of 10 mm. The system weighs 4.5 kg, consumes 65W of power, and can operate stably in temperatures from −10 °C to +40 °C. It is equipped with an integrated GNSS/IMU (200 Hz) for high-precision positioning and orientation, meeting the requirements for 3D forest point cloud acquisition. Four flights were conducted at an average flight altitude of 80 m and an average speed of 6 m/s, yielding a point cloud density of approximately 350 pts/m2. The research workflow included site reconnaissance, ground control point deployment and measurement, flight route planning and LiDAR data acquisition, and indoor data processing and analysis. Based on preliminary data collection and field surveys, the number, H, DBH, and CD of Catalpa trees in the plots were measured. Genotypes with fewer than 30 sample trees were excluded, and samples with both LiDAR-derived tree height (LH or LH) and measured H less than 1.3 m were removed. Ultimately, 78 genotypes comprising a total of 2,899 Catalpa bungei trees were retained for analysis.

DBH, also known as stem diameter, is one of the most fundamental variables in tree measurement. In this study, Catalpa bungei was selected as the target species, and the DBH measured at 1.3 m above the ground was used as the standard DBH for individual trees. A total of 2,899 Catalpa bungei trees in the study area were measured for DBH. The summary statistics of UAV LiDAR-derived individual tree parameters—LiDAR height (LH or LH) and LiDAR-derived crown diameter (LCD or LCD)—are presented in Table 2.

Figure 4 shows the correlations between LiDAR-derived and field-measured parameters. LH exhibited an excellent linear relationship with H (R2 = 0.9232, p< 0.0001), indicating that UAV LiDAR accurately captured vertical structural information. Similarly, the LCD was strongly correlated with the measured CD (R2 = 0.7580, p< 0.0001), though with slightly higher dispersion due to canopy overlap and segmentation uncertainty. The DBH–H relationship (R2 = 0.8337, p< 0.0001) revealed a robust allometric pattern, confirming that H can serve as a reliable predictor of stem size. Collectively, these results demonstrate that UAV LiDAR-derived structural metrics (LH and LCD) can effectively represent field measurements and thus provide a solid foundation for subsequent NLME-based DBH estimation modeling.

Figure 4
Three scatter plots show comparisons of different tree measurements with high correlation values. The top left compares tree height (H) with length (LH), showing a strong linear correlation (R² = 0.9232) with a red regression line, close to the y=x line. The top right compares canopy diameter (CD) with length (LCD), also showing a strong correlation (R² = 0.7580). The bottom plot compares tree height (H) with diameter at breast height (DBH), indicating a strong correlation (R² = 0.8337). Black dots represent data points, with lines showing trend and y=x references.

Figure 4. Relationships between LiDAR-derived and field-measured parameters (LH–H, LCD–CD, and DBH–H) for Catalpa bungei.

In this study, Lidar360 software was used to perform systematic preprocessing of airborne LiDAR point cloud data to ensure the accuracy and reliability of extracted LH and LCD parameters. The preprocessing workflow followed the steps below (Figure 5):

Figure 5
Flowchart illustrating LiDAR point cloud preprocessing. Steps include: (1) Denoising by removing outliers, (2) Ground point classification using gradient and return characteristics, (3) Normalization with a Digital Terrain Model, (4) Individual-tree segmentation using Canopy Height Mode, and (5) Feature extraction determining maximum vertical distance and crown area.

Figure 5. Workflow of LiDAR point-cloud preprocessing and parameter extraction.

The schematic diagram of the raw point cloud data is shown below (Figure 6).

Figure 6
Height map showing trees with color gradations representing different heights. The scale on the left ranges from 0.00 to 11.18, with blue for the lowest and red for the highest points.

Figure 6. Schematic diagram of airborne LiDAR point cloud data in forested plots.

2.3 Predictor variables

In constructing the DBH estimation model based on UAV-LiDAR data,LH and LCD are the most commonly used and stable structural parameters. Studies have shown that LiDAR technology can extract individual tree height with high accuracy (Popescu et al., 2002; Li et al., 2012), and crown width information can also be effectively derived through point cloud-based segmentation methods (Li et al., 2012). Moreover, to avoid model instability and interpretation bias caused by multicollinearity among variables, it is recommended to retain only those variables that are representative and biologically meaningful (Dormann et al., 2013). Therefore, LH and LCD were selected as predictor variables in this study to enhance model stability and generalizability.

2.4 Base model selection

To effectively characterize the nonlinear relationship between DBH and LiDAR-derived variables, four commonly used nonlinear models were selected: logistic, exponential, power, and Richards models (Equations 14). These models are widely applied in tree growth and biomass prediction studies, as they can reflect different growth patterns from juvenile to mature stages (Zeide, 1993). Among them, the logistic and Richards models exhibit typical S-shaped curves, suitable for simulating rapid early growth, accelerated middle-stage growth, and a stabilizing phase later on (Salas-Eljatib et al., 2021). The exponential model is appropriate for describing monotonically increasing growth trends (Salas-Eljatib et al., 2021; Picard et al., 2025), while the power function captures scale-related relationships between variables (Pilli et al., 2006; Salas-Eljatib et al., 2021). Each model has strengths in biological interpretability, flexibility, and adaptability. As such, they were used as candidate models for comparing performance in DBH prediction, and identifying the optimal fitting function. Through preliminary analysis, models with high coefficients of determination (R²), low root mean square errors (RMSE), and low total relative errors (TRE) were selected as the base models (Equations 59). Building upon these, a dummy variable DBH model was developed, followed by the establishment of an NLME DBH model. The formulations of these models are presented as follows:

DBH=a11+b1exp ((c1LH+d1LCD))+ϵ(1)
DBH=a1exp (b1LHc1LCD)+ϵ(2)
DBH=a1LHb1LCDc1+ϵ(3)
DBH=a1(1exp (b1LHc1LCD))+ϵ(4)

where DBH refers to diameter at breast height. Parameters a1, b1, c1, and d1 are coefficients to be estimated. LH denotes tree height extracted from LiDAR data, LCD represents crown width extracted from LiDAR data, and ϵ is the residual error.

The best-performing base model was selected based on the following statistical criteria:

e¯=1Nei=1Ni=1N(DBHiDB^Hi)(5)
σ2=1N1i=1N(eie¯)2(6)
TRE=100×i=1N(DBHiDB^Hi)2i=1N DBHi2(7)
R2=1i=1N(DBHiDB^Hi)2i=1N(DBHiDBH¯)2(8)
RMSE=e¯2+σ2(9)

Where: DBHi represents the ith observed DBH; DB^Hi denotes the ith predicted DBH; DBH¯ is the mean of observed DBH values; ei is the prediction error for the ith sample; and N is the total number of samples. The indicators e¯, σ2, TRE, R2, and RMSE represent the mean error, error variance, total relative error, coefficient of determination, and root mean square error, respectively. RMSE is defined as a combined measure of the average bias and its variance, and is considered one of the most important metrics for evaluating model performance. These indicators provide a comprehensive assessment of the prediction accuracy and reliability of the model.

2.5 Dummy variable model

The dummy variable model is a statistical model widely used for analyzing qualitative or categorical variables. Dummy variables, also known as indicator variables or nominal variables, are primarily used to convert categorical variables into a form suitable for regression analysis, allowing the model to handle non-numeric or unordered categorical data more effectively. In this study, stand density was treated as a qualitative variable, and the dummy variable approach was employed to analyze its influence on the parameters of the DBH growth model.

The dummy variables in this study take values of 0 and 1, assigned as follows:

Pi={1,highdensity planting0,lowdensity planting,i=1

Two planting densities—high and low—were considered. The effect of planting density was incorporated into four base model forms using dummy variables, with the corresponding equations expressed as (Equations 1013).

DBH=aiPi1+(biPi)exp ((ciPiLH+diPiLCD))+ϵ(10)
DBH=(aiPi)exp (biPiLHciPiLCD)+ϵ(11)
DBH=(aiPi)LHbiPiLCDciPi+ϵ(12)
DBH=(aiPi)(1exp (biPiLHciPiLCD))+ϵ(13)

ai, bi, ci, di (i = 1, 2) are parameters related to site conditions in the dummy variable models; Pi denotes the dummy variable, and the other parameters are defined as in the aforementioned base models.

2.6 Nonlinear mixed-effects DBH estimation model

The mixed-effects model is formulated based on a regression function that incorporates both fixed-effect and random-effect parameters (Pinheiro et al., 1999). The general form of the single-level mixed-effects model is shown in Equation 14.

{yij=f(Φi,Xij)+ϵij,i=1,,M, j=1,,ni,Φi=Aiβ+k=1KBi(k)ui(k),ui(k)N(0,ψ(k)), cov(ui(k),ui(l))=0, kl, k=1,,K, l=1,,KϵiN(0, Ri),ϵi=(ϵi1,,ϵini)T(14)

where yij is the jth observation of the ith subject with respect to the dependent variable; M is the number of subjects; ni is the number of observations for the ith subject; f(·) is a real-valued and differentiable nonlinear function that depends on a subject-specific parameter vector Φip and a covariate vector Xijs; βp0 is the fixed-effect vector; ui(k)q(k) is the random-effect vector, assumed to follow a normal distribution with mean zero and variance-covariance matrix ψ(k). For different k and l, if kl, then ui(k) and ui(l) are mutually independent. The residual vector ϵi is also assumed to follow a normal distribution with mean zero and covariance matrix Ri, and is independent of the random effects.

When constructing an NLME model with Catalpa bungei genotype as the random-effect factor, it is usually necessary to determine the optimal random-effect structure using model selection criteria. The most commonly used criteria are the Akaike Information Criterion (AIC) (Equation 15) and the Bayesian Information Criterion (BIC) (Equation 16). AIC emphasizes model fitting accuracy while accounting for model complexity, making it suitable for relatively small datasets. In contrast, BIC imposes a stricter penalty on the number of model parameters and tends to favor simpler model structures, especially in large-sample settings. Since AIC and BIC differ in focus, combining both metrics provides a more comprehensive assessment of the trade-off between model fit and complexity, enabling more robust model selection. The nonlinear mixed-effects model was implemented using the ‘nlme’ package in R version 4.2.

AIC=2kln(L)(15)
BIC=k ln(N)2 ln(L)(16)

where k is the number of model parameters, N is the number of samples, and L is the value of the likelihood function.

2.7 Determination of the variance–covariance Matrix (Ψ) structure among different genotypes

The variance–covariance matrix Ψ for the genotype-level random effects is assumed to be identical for all genotypes and is used to describe the variability in DBH among genotypes. Ψ is assumed to be an unstructured matrix (Calama and Montero, 2005). We assumed the following 2 × 2 variance–covariance matrix:

Ψ=(σ112ρ12ρ21σ222)

where σi2(i, j = 1, 2, i = j) is the variance of the ith random effect, and σij(i,j = 1,2,i  j) is the covariance between the ith and jth random effects.

2.8 Determination of the variance–covariance matrix (Ri) structure within the same genotype

We adopted a general framework for modeling the within-group error covariance structure to account for heteroscedasticity and autocorrelation within the same genotype group (Davidian and Giltinan, 2017), both of which are reflected in Ri (Equation 17) (Davidian and Giltinan, 2017; Meng and Huang, 2009):

Ri=σ2Gi0.5ΓiGi0.5(17)

where σ2 is a dispersion parameter, also referred to as the scaling factor, which equals the residual variance of the model; Gi is an ni×ni diagonal matrix representing heteroscedastic variance within subplot samples; and Γi is an ni×ni matrix representing autocorrelation within the same genotype. In our data, no autocorrelation pattern was observed among observations; thus, Γi was simplified to an ni× ni identity matrix.

We evaluated two commonly used variance-stabilizing functions: the exponential function (Equation 18) and the power function (Equation 19), to address variance heterogeneity (Rombouts et al., 2010). The most effective variance function was then selected using the Likelihood Ratio Test (LRT) and the Akaike Information Criterion (AIC) (Pinheiro et al., 1999; Fang and Bailey, 2001):

var(ϵij)=σ2 exp (2γxij)(18)
var(ϵij)=σ2xij2γ(19)

where xij is the selected predictor variable, LH or LCD, and γ is the parameter to be estimated.

2.9 Model estimation

In this study, all variants of the NLME models were fitted using the maximum likelihood estimation method based on the Lindstrom and Bates (LB) algorithm in R software (version 3.4.2). The LB algorithm has been extensively described in previous studies (Pinheiro et al., 1999; Fang and Bailey, 2001; Lindstrom and Bates, 1990). It estimates model parameters through an iterative optimization process that maximizes the likelihood function of the model given the data. The nlme function in R implements this algorithm, supporting the fitting of complex models and simultaneously handling both fixed and random effects. This method is particularly advantageous for analyzing correlated data due to its ability to account for hierarchical or grouped data structures.

2.10 Subject-specific prediction

The NLME-based DBH estimation model was used to predict DBH values both with and without random effects. This study focused on prediction models that include random effects, also known as subject-specific or localized models. Localized mixed-effects models are also referred to as calibration models (Meng and Huang, 2009). The calibration process requires prior information—in this study, it refers to the measured DBH values of subsample trees. The prediction of random effects was performed using the empirical best linear unbiased prediction (EBLUP) method (Meng and Huang, 2009; Fang and Bailey, 2001).

u^i=ΨZiT(R^i+ZiΨ^ZiT)1ei,=ΨZiT(R^i+ZiΨ^ZiT)1[yif(β^,ui*,xi)+Ziui*](20)

where u^i is the q-dimensional predicted random effect vector for the ith subplot (i=1,,M); ui* is the EBLUP vector of the random effect ui; f(·) is the NLME model for DBH estimation; β^ is theestimated vector of fixed-effect parameters β; xi is the vector of predictor variables; Ψ^ is the estimated variance–covariance matrix of the random effects ui (i=1,,M); R^i is the estimated variance–covariancematrix of the residual ei; and Zi is an ni×q design matrix representing the partial derivative of the nonlinear mixed-effects model f(·) with respect to the random effects ui. Since the unknown random effects appear on both sides of (Equation 20), u^i cannot be solved by direct algebraic methods. To address this, a three-step iterative algorithm based on EBLUP theory was developed to predict random effects (Meng and Huang, 2009), and a computer program in R was subsequently developed to implement this algorithm (Fu et al., 2017).

2.11 Sampling strategies and model calibration evaluation

We used DBH measurements from different numbers of sample trees to predict the random effects in (Equation 20). In general, the more sample trees used in a localized model,the more accurate the prediction (Fu et al., 2017; Xie et al., 2023). Many modeling studies have identified the optimal number of samples required for calibrating NLME models to achieve reliable prediction accuracy.Based on previous studies (Fu et al., 2020), we applied the following four sampling schemes for selecting sample trees from each subplot to capture subject-specific variation in DBH:

1. The 1 to 10 smallest trees per genotype based on DBH.

2. The 1 to 10 largest trees per genotype based on DBH.

3. A random selection of 1 to 10 medium-sized trees per genotype (between the 20th and 80th percentiles of DBH).

4. A random selection of 1 to 10 trees per genotype regardless of size.

We evaluated the prediction performance of each scheme using common statistical metrics, including total relative error (TRE, (Equation 7)), coefficient of determination (R², (Equation 8)), and root mean square error (RMSE, (Equation 9)).

2.12 Model evaluation

We employed a leave-one-out cross-validation approach to assess the performance of the NLME models. In each iteration, the Catalpa sample of one genotype category was selected as the validation set, while the samples from the remaining 77 genotype categories formed the training set. The DBH model was fitted using the training set to estimate model parameters. These estimated parameters were then used to make predictions on the validation set, and prediction statistics—such as the coefficient of determination (R2), root mean square error (RMSE), and total relative error (TRE) [see (Equations 59)]—were calculated. Once one genotype had been validated, it was returned to the dataset, and another genotype was selected as the new validation set. This process was repeated until each genotype had been used once as the validation set.

3 Results

3.1 Parametric regression models

3.1.1 Base models

During model fitting for DBH, the logistic model (RMSE = 1.2495, R2 = 0.8326, TRE = 4.9285) and the power function model (RMSE = 1.3120, R2 = 0.8155, TRE = 5.4605) showed the best accuracy (Table 3). Although the logistic model slightly outperformed the power function model, it had a more complex form, and its residual plots showed a radiating pattern (Figure 7), indicating more severe heteroscedasticity compared to the power function model. Therefore, the power function model was chosen as the base model for developing the dummy variable DBH prediction models.

Table 3
www.frontiersin.org

Table 3. Fitting statistics of the four candidate base models.

Figure 7
Four residual plots compare different models: Logistic, Exponential, Power, and Richards. Each scatter plot shows residuals on the y-axis and fitted DBH on the x-axis, with histograms along the top and right edges for the distribution of fitted values and residuals.

Figure 7. Residual plots of DBH fitted values for the four models and BIC 9150.3254 values were substantially lower, indicating improved predictive accuracy with a more parsimonious model.

3.1.2 Dummy variable models

To improve the convergence rate of the subsequent NLME models, we simplified the structure of the dummy variable models. The specific forms of the three extended equations are shown below.

DBH=(a1+a2·P1)LHb1LCDc1+ϵ(21)
DBH=a1LH(b1+b2·P1)LCDc1+ϵ(22)
DBH=a1LHb1LCD(c1+c2·P1)+ϵ(23)

Among Equations 2123, (Equation 22) demonstrated higher accuracy (Table 4) and was selected as the base form for the NLME model. The power function dummy variable model significantly improved model performance. Compared to the base power model, the power dummy variable model (Equation 22) reduced RMSE by 11.21%, increased R2 by 4.78%, and reduced TRE by 22.09%. Additionally, the model’s AIC (9120.4648).

Table 4
www.frontiersin.org

Table 4. Fit statistics of three different models (AIC, Akaike’s Information Criterion; BIC, Bayesian Information Criterion).

3.1.3 NLME models

Considering that the model included four parameters (a1, b1, b2, c1), the corresponding random effects were u1, u2, u3, and u4. There were 15 possible combinations of random effects in the power dummy variable model. Taking into account the trade-off between model fitting accuracy and the number of parameters, the NLME model with random effects u2 and u3 acting on b1 and b2 was selected as the optimal nonlinear mixed-effects model.

Ultimately, we selected the NLME model in which random effects were assigned to parameters b1 and b2, with AIC = 8980.5630 and BIC = 9028.3400. This model achieved the lowest AIC and BIC values among the candidates while maintaining relatively low model complexity, thereby ensuring successful fitting and validation of the NLME model.

The fitting statistics of the NLME model (b1 + b2) were: RMSE = 1.0837, R2 = 0.8741, TRE = 3.6623. Compared to the power dummy variable model (Equation 22), the NLME model (b1 +b2) reduced RMSE by 6.96%, increased R2 by 2.29%, and reduced TRE by 13.93%. Additionally, AIC = 8980.5630 and BIC = 9028.3400 were also improved.

Most parameter estimates were statistically significant (p< 0.05). Even with the introduction of random effects, heteroscedasticity was still present in the DBH estimation model (Equation 24). The final NLME DBH model is as follows:

DBH=a1LH(b1+μ2+(b2+μ3)·P1)LCDc1+ϵ(24)

where DBH denotes the diameter at breast height of Catalpa trees; a1, b1, b2, and c1 are fixed-effect parameters; LHand LCDrepresent LiDAR-derived tree height and crown diameter, respectively; u2 is the random effect on b1 due to genotype variation; u3 is the random effect on b2 due to genotype variation; P1 is a dummy variable; and ϵ is the residual error.

3.2 Parameter estimation

We evaluated two types of variance-stabilizing functions. Among them, the power variance function using LH as the predictor variable performed best. Although the residual plots did not visually show a clear improvement after introducing the power variance function—possibly due to the large sample size and dense distribution of residual points, which may obscure local structure variations—the AIC value of the model decreased significantly (from 8980.56 to 8384.73), indicating that model fitting improved substantially. Therefore, considering both model evaluation metrics and the rationality of the error structure, the final model retained the power-type variance structure to enhance the robustness of parameter estimation and the overall interpretability of the model. This structure was applied to (Equation 24). After substituting the estimated values of the fixed-effect parameters into (Equation 24), the final NLME DBH estimation model becomes:

DBH=0.84·LH(1.183+u2+(0.1427+u3)·P1)·LCD0.2726+ϵ(25)

where:

ui=[u2u3]N([00],Ψ1=(7.763E034.857E034.857E035.401E03)),
ϵi=(ϵi1,,ϵini)TN (0, Ri=0.1212·Gi0.5ΓiGi0.5),
Gi=diag(LHi11.683,,LHini1.683),
Γi=Ini,

The matrix Iniis an ni× niidentity matrix. All other parameters and predictor variables in this model are defined as previously described.The estimated parameters for (Equation 25) are listed in Table 5.

Table 5
www.frontiersin.org

Table 5. Parameter estimates and fit statistics of two different models (AIC, Akaike’s information criterion; and LL, log-likelihood).

3.3 Subject-specific DBH prediction

The medium and random strategies demonstrated the most stable and consistent performance, as shown in Figure 8. Both showed smooth and continuous improvement in all metrics, and their curves were almost identical, suggesting that these two approaches maintained high accuracy and good generalization ability across different sample sizes. Among them, the random strategy achieved comparable accuracy to the medium strategy but offered greater practical advantages due to its simplicity and flexibility in implementation.

Figure 8
Three line graphs display data against sample numbers one to ten with different metrics: the first graph shows root mean square error, the second displays R-squared, and the third shows total relative error for four categories: biggest, smallest, random, and medium. Each category has distinct line styles and colors. The biggest line shows more variation, while the smallest line tends to remain more stable across graphs. The background has a gradient from pink to blue.

Figure 8. RMSE, R², and TRE of (Equation 25) calibrated using four sampling strategies. Sampling strategies correspond to 1), 2), 3), and 4) in Section 2.11.

It is particularly noteworthy that when the number of randomly sampled trees increased from one to two, the model accuracy showed a noticeable improvement, which was relatively more pronounced compared with the other cases: RMSE decreased to some extent, R² increased, and TRE exhibited a further declining trend. This indicates that under the random sampling strategy, even a very small calibration sample can effectively enhance model performance, while additional samples yield diminishing returns.

In conclusion, the random sampling strategy is recommended, with two sampled trees serving as the optimal calibration configuration. This approach ensures both model stability and prediction accuracy while significantly reducing sampling and computational costs, making it the most practical and scientifically sound choice.

3.4 Model evaluation

All statistical models developed in this study—including the base model, dummy variable model, and NLME model—were evaluated using an independent dataset. During evaluation, prediction accuracy was quantified using methods defined by (Equation 5) through (Equation 9), producing key statistical metrics including the coefficient of determination (R2), root mean square error (RMSE), and total relative error (TRE). These indicators provided important reference points for assessing model performance.

All models were evaluated on independent datasets using three statistical metrics (RMSE, R², and TRE). During model fitting, (Equation 25) achieved = 0.8624 (5.75% higher than the base power model), RMSE = 1.1330 (13.63% lower), and TRE = 3.9556 (27.60% lower), indicating that the nonlinear mixed-effects model outperformed traditional parametric models in fitting performance.

Furthermore, to assess model robustness, a leave-one-out cross-validation was conducted. Under the random sampling strategy with two sampled trees, the results were = 0.8392, RMSE = 1.1444, and TRE = 4.4238, demonstrating that the calibrated model maintained high predictive accuracy and generalization capability during independent validation.

4 Discussion

This study developed a NLME model for predicting the DBH of Catalpa bungei using UAV LiDAR data and genotype information.

4.1 Methodological advances in DBH modeling

Remote sensing technologies, particularly LiDAR, have substantially advanced forest structural assessment by capturing three-dimensional canopy and tree attributes. Traditional DBH models rely on H and CD as key predictors, but often assume population homogeneity. While effective in some contexts, such models overlook genetic variability and management-induced differences that influence tree growth.

The present study introduced genotype as a random effect into the NLME framework, enabling the model to capture intraspecific variation that conventional approaches ignore. Compared with previous LiDAR-based DBH models that incorporated only plot-level or topographic effects (Fu et al., 2020; Yang et al., 2020), our model provides a more refined characterization of growth variability. This innovation allowed us to better explain differences among individuals growing under identical site conditions but with distinct genetic backgrounds. The resulting improvement in predictive performance demonstrates the value of embedding genetic heterogeneity in forest structural modeling.

Beyond genotype, the model structure integrated LiDAR-derived variables (H and CD) with a planting density dummy variable to reflect management effects. This combination enhanced explanatory power and improved generalization. While prior studies have emphasized structural predictors or error-in-variable frameworks, they often remained within fixed-effects models (Fu et al., 2018). By contrast, the NLME approach adopted here not only accommodates structural and management variables but also provides a robust statistical basis for incorporating biological diversity. This methodological flexibility suggests broad applicability across species and ecosystems where genetic and management factors shape tree allometry.

4.2 Challenges and perspectives in LiDAR–field data alignment

Although the NLME model demonstrated superior performance under calibration conditions, this advantage largely depends on the high-precision alignment between LiDAR data and field measurements. In practical applications, accurately matching LiDAR-derived tree structural parameters with field-measured DBH remains challenging, especially in degraded stands or areas with complex and overlapping canopy structures. Crown occlusion, sparse point density, and incomplete returns often lead to mismatches between individual trees, thereby affecting model calibration accuracy.

To minimize such errors, future studies should aim to maximize the precision of LiDAR data processing during acquisition and matching, and integrate high-accuracy GNSS positioning systems to assist in spatial registration of individual trees. In addition, several strategies can be considered:

i. using high-density UAV LiDAR data combined with accurately measured ground control points for geometric correction;

ii. recording tree coordinates or stem positions simultaneously during field measurements;

iii. applying robust tree-matching algorithms based on multi-attribute similarity (e.g., H, CD, spatial proximity);

iv. adopting probabilistic or fuzzy matching methods to address uncertain correspondences.

These measures can help ensure the spatial consistency and reliability of calibration samples, thereby further improving the predictive accuracy and practical applicability of NLME-based models.

4.3 Comparison with existing studies

Previous studies have shown that NLME models can effectively capture forest structural or spatial heterogeneity by incorporating plots or sites as random effects (Fu et al., 2020; Zhang et al., 2021; Zhou et al., 2023). However, these approaches generally assume population homogeneity and fail to adequately account for the influence of intra-specific genetic variation on tree allometric relationships. In contrast, this study introduces genotype as a random effect within the NLME framework, extending mixed-effects modeling from the environmental to the genetic level. This innovation not only significantly improved model accuracy (R2 = 0.8624, RMSE = 1.1330, TRE = 3.9556) but also enhanced the biological interpretability and ecological applicability of the model by linking LiDAR-derived structural parameters with genotypic growth variation.

Moreover, unlike the calibration strategies adopted by (Fu et al., 2020; Zhou et al., 2023), which relied on selecting the largest or smallest individuals, our findings demonstrate that random sampling can achieve stable and highly accurate predictions with minimal field effort. This result stems from the hierarchical structure of the NLME model and the pronounced genetic heterogeneity within the stand. Incorporating genotype as a random effect enables the model to absorb systematic variation among individuals, making randomly selected samples more representative of overall growth variability. In contrast, extreme sampling (e.g., selecting only the largest or smallest trees) may bias calibration toward specific genotypes or microsite conditions, thereby reducing model generalization. Overall, random sampling achieves an optimal balance between representativeness, accuracy, and field efficiency, underscoring the advantages of genotypebased random-effect modeling for accurate and scalable DBH estimation in genetically diverse forest stands.

4.4 Ecological implications of accurate DBH estimation

DBH is widely recognized as the most reliable measurable tree attribute for estimating aboveground biomass (AGB). Errors in DBH estimation propagate into substantial uncertainties in biomass and carbon stock assessments at stand or regional scales. Small deviations can lead to significant over- or underestimation of carbon sequestration potential, undermining both scientific assessments and climate policy mechanisms such as carbon accounting and trading. By improving DBH prediction accuracy, our genotype-informed NLME model contributes directly to more reliable carbon stock estimation and climate change mitigation strategies.

In addition to biomass, DBH is often used as a proxy for biodiversity- and habitat-related indicators. Larger trees typically support greater structural complexity and offer habitat niches for numerous organisms. Accurate estimation of DBH distributions therefore improves our ability to evaluate forest structural diversity, habitat quality, and ecosystem services. By reducing prediction uncertainties, the model strengthens the reliability of ecological indicators that reflect forest health and biodiversity.

The explicit incorporation of genotype is particularly relevant for ecological monitoring. Forest indicators derived from remote sensing often reflect spatial variability, but genetic variability is rarely considered. Our findings show that genetic diversity exerts measurable effects on tree structure, suggesting that ignoring it may underestimate variability in forest productivity and resilience. Genotype-sensitive modeling thus provides a pathway toward integrating biological diversity into remote sensing indicators, a step that can enrich ecological monitoring frameworks and improve our understanding of forest adaptive capacity under environmental change.

4.5 Practical considerations for monitoring and sampling

A key component of this study was the evaluation of calibration strategies for mixed-effects models. Four sampling approaches were tested, and random sampling consistently emerged as the most efficient, maintaining accuracy while minimizing fieldwork. Notably, even minimal random sampling (two trees) significantly improved model performance, highlighting the practicality of this approach for large-scale surveys.

This insight has strong operational implications. UAV LiDAR surveys increasingly cover large and remote areas where extensive ground measurements are impractical. By adopting efficient calibration strategies, forest monitoring programs can optimize resource allocation while ensuring data quality. The proposed methodology thus offers a practical template for combining remote sensing with limited ground calibration, applicable to genetically diverse plantations as well as heterogeneous natural forests.

4.6 Transferability, limitations, and future directions

Although this study focused on Catalpa bungei, the proposed methodological framework is broadly transferable. The flexibility of the NLME model allows incorporation of different random effects, such as species identity, site conditions, or management regimes, making it adaptable across diverse forest ecosystems.

While the proposed genotype-informed NLME framework performed exceptionally well in Catalpa bungei plantations, its broader applicability across different tree species and forest types warrants further investigation. Tree species with contrasting growth architectures, diverse reproductive strategies, or more complex interspecific competition may exhibit distinct genotype–phenotype interactions that alter allometric scaling patterns. Extending this framework to natural or seed-propagated forests could therefore reveal new insights into how genetic diversity interacts with environmental heterogeneity to shape individual tree growth. Future studies should explore these dynamics by incorporating additional physiological, climatic, and soil factors, thereby testing the robustness, adaptability, and ecological generality of the proposed modeling approach.

Nevertheless, we acknowledge several limitations. First, while genotype was explicitly included, further validation across multiple species and regions is needed to assess generalizability. Second, only two structural variables (H and CD) were derived from UAV LiDAR; incorporating additional metrics such as crown volume, canopy density, or stand-level heterogeneity may further improve performance. Third, environmental covariates such as soil type, water availability, and microclimate were not included but could be integrated in future studies to capture broader ecological variability. Finally, uncertainties in this study may arise from both observational and sampling sources. Field-measured DBH and LiDARderived structural variables are inevitably affected by positional mismatches, sensor noise, and scaling inconsistencies, which can propagate into model predictions (Richardson et al., 2008). Moreover, variations in sampling intensity and the representativeness of field plots may introduce additional uncertainty, particularly in genetically diverse or structurally heterogeneous stands. Similar to the challenges faced in long-term meteorological monitoring, where gap-filling and interpolation can lead to systematic biases (Shen et al., 2017), minor inconsistencies between field and LiDAR observations may distort calibration accuracy. Although rigorous data preprocessing, filtering, and model fitting were implemented to minimize these effects, the influence of measurement and sampling uncertainty cannot be entirely eliminated. Future work should therefore emphasize enhanced field–LiDAR co-registration accuracy and uncertainty quantification to further improve model robustness and transferability.

Future research should thus focus on expanding the model’s scope across species, ecosystems, and geographic regions. Integrating more detailed LiDAR-derived indicators and environmental data would not only enhance model accuracy but also allow exploration of genotype–environment interactions. Moreover, linking DBH predictions with higher-level ecological processes (e.g., carbon dynamics, resilience under climate extremes) could provide even more powerful insights for ecological monitoring and indicator development.

4.7 Contributions to ecological indicators and monitoring

By bridging UAV LiDAR data, advanced statistical modeling, and genetic information, this study contributes to the development of ecological indicators in several ways. First, it strengthens the accuracy of DBH estimation, which underlies key indicators of biomass, carbon stocks, and forest productivity. Second, it demonstrates that incorporating genetic diversity improves the ecological relevance of structural indicators, aligning them more closely with biodiversity and resilience assessments. Third, it provides practical guidance for sampling and calibration in UAV-based monitoring, thereby enhancing the efficiency and scalability of ecological surveys.

Together, these contributions advance the role of remote sensing in ecological monitoring, bridging the gap between individual-level measurements and ecosystem-level assessments.

5 Conclusion

This study developed a NLME model for predicting the DBH of Catalpa bungei by integrating UAV LiDARderived structural variables with genotype information. DBH serves as a key indicator of tree growth, biomass, and stand structure, and its accurate estimation is essential for evaluating forest productivity, ecosystem health, and carbon storage.

Incorporating genotype as a random effect markedly improved model accuracy and reduced uncertainty, emphasizing the importance of genetic diversity in shaping tree allometry and growth variability. LH and LCD effectively represented structural attributes related to DBH, enabling robust predictions across different stand conditions. Random sampling proved to be the most efficient calibration strategy, achieving high accuracy with minimal fieldwork.

Overall, the genotype-sensitive NLME framework provides a scalable and ecologically meaningful approach to improving DBH estimation, supporting precise forest inventory, carbon accounting, and sustainable management of genetically diverse forest ecosystems.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Author contributions

YZ: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Validation, Visualization, Writing – original draft. MZ: Conceptualization, Data curation, Funding acquisition, Investigation, Project administration, Validation, Visualization, Writing – original draft. QC: Conceptualization, Investigation, Methodology, Funding acquisition, Project administration, Resources, Supervision, Writing – review & editing. LF: Data curation, Formal Analysis, Writing – original draft. WM: Methodology, Visualization, Writing – original draft. GD: Data curation, Investigation, Resources, Writing – original draft. XF: Formal Analysis, Validation, Visualization, Writing – original draft. ZZ: Data curation, Writing – original draft. CW: Investigation, Resources, Writing – original draft. QW: Investigation, Resources, Writing – original draft. YS: Investigation, Resources, Writing – original draft. PL: Visualization, Writing – original draft.

Funding

The author(s) declared that financial support was received for work and/or its publication. This work was supported by the Fundamental Research Funds for the Central Non-profit Research Institution of CAF under Grant Nos. CAFYBB2022ZB002 and CAFYBB2020GC006.

Acknowledgments

This work is supported by the Fundamental Research Funds for the Central Non-profit Research Institution of CAF (CAFYBB2022ZB002, CAFYBB2020GC006). We appreciate the comments and suggestions given by all reviewers of this study.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

Balestra, M., Marselis, S., Sankey, T. T., Cabo, C., Liang, X., Mokroš, M., et al. (2024). enLiDAR data fusion to improve forest attribute estimates: A review. Curr. Forestry Rep. 10, 281–297. doi: 10.1007/s40725-024-00223-7

Crossref Full Text | Google Scholar

Borsah, A. A., Nazeer, M., and Wong, M. S. (2023). enLIDAR-based forest biomass remote sensing: A review of metrics, methods, and assessment criteria for the selection of allometric equations. Forests. 14, 2095. doi: 10.3390/f14102095

Crossref Full Text | Google Scholar

Bronisz, K. and Mehtätalo, L. (2020). enMixed-effects generalized height–diameter model for young silver birch stands on post-agricultural lands. For. Ecol. Manage. 460, 117901. doi: 10.1016/j.foreco.2020.117901

Crossref Full Text | Google Scholar

Calama, R. and Montero, G. (2005). Multilevel linear mixed model for tree diameter increment in stone pine (Pinus pinea): a calibrating approach. Silva Fennica. 39, 37–54. doi: 10.14214/sf.394

Crossref Full Text | Google Scholar

Dalponte, M., Frizzera, L., Ørka, H. O., Gobakken, T., Næsset, E., and Gianelle, D. (2018). enPredicting stem diameters and aboveground biomass of individual trees using remote sensing data. Ecol. Indic. 85, 367–376. doi: 10.1016/j.ecolind.2017.10.066

Crossref Full Text | Google Scholar

Davidian, M. and Giltinan, D. M. (2017). enNonlinear Models for Repeated Measurement Data. 1 edn (New York, NY: Routledge). doi: 10.1201/9780203745502

Crossref Full Text | Google Scholar

De Conto, T., Armston, J., and Dubayah, R. (2024). enCharacterizing the structural complexity of the Earth’s forests with spaceborne lidar. Nat. Commun. 15, 8116. doi: 10.1038/s41467-024-52468-2

PubMed Abstract | PubMed Abstract | Crossref Full Text | Google Scholar

Dormann, C. F., Elith, J., Bacher, S., Buchmann, C., Carl, G., Carré, G., et al. (2013). enCollinearity: a review of methods to deal with it and a simulation study evaluating their performance. Ecography. 36, 27–46. doi: 10.1111/j.1600-0587.2012.07348.x

Crossref Full Text | Google Scholar

Fang, Z. and Bailey, R. L. (2001). enNonlinear mixed effects modeling for slash pine dominant height growth following intensive silvicultural treatments. For. Sci. 47, 287–300. doi: 10.1093/forestscience/47.3.287

Crossref Full Text | Google Scholar

Fu, L., Duan, G., Ye, Q., Meng, X., Luo, P., Sharma, R. P., et al. (2020). enPrediction of individual tree diameter using a nonlinear mixed-effects modeling approach and airborne liDAR data. Remote Sens. 12, 1066. doi: 10.3390/rs12071066

Crossref Full Text | Google Scholar

Fu, L., Liu, Q., Sun, H., Wang, Q., Li, Z., Chen, E., et al. (2018). enDevelopment of a system of compatible individual tree diameter and aboveground biomass prediction models using error-in-variable regression and airborne liDAR data. Remote Sens. 10, 325. doi: 10.3390/rs10020325

Crossref Full Text | Google Scholar

Fu, L., Zhang, H., Sharma, R. P., Pang, L., and Wang, G. (2017). enA generalized nonlinear mixed-effects height to crown base model for Mongolian oak in northeast China. For. Ecol. Manage. 384, 34–43. doi: 10.1016/j.foreco.2016.09.012

Crossref Full Text | Google Scholar

He, S., Osman, Z., Cladera, F., Ong, D., Rai, N., Green, P. C., et al. (2025). Estimating the diameter at breast height of trees in a forest with a single 360 camera. doi: 10.48550/arXiv.2505.03093

Crossref Full Text | Google Scholar

Ismail, M. J., Poudel, T. R., Ali, A., and Dong, L. (2025). Incorporating stand parameters in nonlinear height-diameter mixed-effects model for uneven-aged Larix gmelinii forests. Front. Forests Global Change. 7. doi: 10.3389/ffgc.2024.1491648

Crossref Full Text | Google Scholar

King, D. A. (2005). enLinking tree form, allocation and growth with an allometrically explicit model. Ecol. Model. 185, 77–91. doi: 10.1016/j.ecolmodel.2004.11.017

Crossref Full Text | Google Scholar

Li, G., Harrison, S. P., Prentice, I. C., and Falster, D. (2014). enSimulation of tree-ring widths with a model for primary production, carbon allocation, and growth. Biogeosciences. 11, 6711–6724. doi: 10.5194/bg-11-6711-2014

Crossref Full Text | Google Scholar

Li, W., Guo, Q., Jakubowski, M. K., and Kelly, M. (2012). enA new method for segmenting individual trees from the lidar point cloud. Photogrammetric Eng. Remote Sens. 78, 75–84. doi: 10.14358/PERS.78.1.75

Crossref Full Text | Google Scholar

Lindstrom, M. J. and Bates, D. M. (1990). Nonlinear mixed effects models for repeated measures data. Biometrics. 46, 673. doi: 10.2307/2532087

PubMed Abstract | PubMed Abstract | Crossref Full Text | Google Scholar

Liu, Q., Fu, L., Wang, G., Li, S., Li, Z., Chen, E., et al. (2020). Improving estimation of forest canopy cover by introducing loss ratio of laser pulses using airborne liDAR. IEEE Trans. Geosci. Remote Sens. 58, 567–585. doi: 10.1109/TGRS.2019.2938017

Crossref Full Text | Google Scholar

Mencuccini, M. (2003). enThe ecological significance of long-distance water transport: short-term regulation, long-term acclimation and the hydraulic costs of stature across plant life forms. Plant Cell Environ. 26, 163–182. doi: 10.1046/j.1365-3040.2003.00991.x

Crossref Full Text | Google Scholar

Meng, S. X. and Huang, S. (2009). enImproved calibration of nonlinear mixed-effects models demonstrated on a height growth function. For. Sci. 55, 238–248. doi: 10.1093/forestscience/55.3.238

Crossref Full Text | Google Scholar

Oehmcke, S., Li, L., Revenga, J. C., Nord-Larsen, T., Trepekli, K., Gieseke, F., et al. (2022). “enDeep learning based 3D point cloud regression for estimating forest biomass,” in Proceedings of the 30th International Conference on Advances in Geographic Information Systems. (ACM, Seattle Washington), 1–4. doi: 10.1145/3557915.3561471

Crossref Full Text | Google Scholar

Ouyang, C., He, X., Lin, R., and Qin, K. (2024). enExploring the correlation between tree structure characteristics and carbon storage in historic gardens using TLS technology: a case study of Jian Xin Pavilions at Jingyi Park, Fragrant Hills Park. Heritage Sci. 12, 380. doi: 10.1186/s40494-024-01500-1

Crossref Full Text | Google Scholar

Picard, N., Fonton, N., Boyemba Bosela, F., Fayolle, A., Loumeto, J., Ngua Ayecaba, G., et al. (2025). enSelecting allometric equations to estimate forest biomass from plot- rather than individual-level predictive performance. Biogeosciences. 22, 1413–1426. doi: 10.5194/bg-22-1413-2025

Crossref Full Text | Google Scholar

Pilli, R., Anfodillo, T., and Carrer, M. (2006). enTowards a functional and simplified allometry for estimating forest biomass. For. Ecol. Manage. 237, 583–593. doi: 10.1016/j.foreco.2006.10.004

Crossref Full Text | Google Scholar

Pinheiro, J., Bates, D., and R Core Team (1999). ennlme: Linear and Nonlinear Mixed Effects Models. (New York, NY: Springer). doi: 10.32614/CRAN.package.nlme

Crossref Full Text | Google Scholar

Popescu, S. C., Wynne, R. H., and Nelson, R. F. (2002). enEstimating plot-level tree heights with lidar: local filtering with a canopy-height based variable window size. Comput. Electron. Agric. 37, 71–95. doi: 10.1016/S0168-1699(02)00121-7

Crossref Full Text | Google Scholar

Qin, Y., He, X., Guo, H., Zhou, C., Zeng, W., Chen, X., et al. (2025). enNational growth models for stand basal area, volume, and biomass in Chinese larch plantations: integrating stand structure and species effects. Trees Forests People. 21, 100943. doi: 10.1016/j.tfp.2025.100943

Crossref Full Text | Google Scholar

Raptis, D., Kazana, V., Kazaklis, A., and Stamatiou, C. (2018). enA crown width-diameter model for natural even-aged black pine forest management. Forests. 9, 610. doi: 10.3390/f9100610

Crossref Full Text | Google Scholar

Richardson, A. D., Mahecha, M. D., Falge, E., Kattge, J., Moffat, A. M., Papale, D., et al. (2008). enStatistical properties of random CO2 flux measurement uncertainty inferred from model residuals. Agric. For. Meteorology. 148, 38–50. doi: 10.1016/j.agrformet.2007.09.001

Crossref Full Text | Google Scholar

Rombouts, J., Ferguson, I. S., and Leech, J. W. (2010). enCampaign and site effects in LiDAR prediction models for site-quality assessment of radiata pine plantations in South Australia. Int. J. Remote Sens. 31, 1155–1173. doi: 10.1080/01431160903380573

Crossref Full Text | Google Scholar

Ryan, M. G. and Yoder, B. J. (1997). Hydraulic limits to tree height and tree growth. BioScience. 47, 235–242. doi: 10.2307/1313077

Crossref Full Text | Google Scholar

Salas-Eljatib, C., Mehtätalo, L., Gregoire, T. G., Soto, D. P., and Vargas-Gaete, R. (2021). enGrowth equations in forest research: mathematical basis and model similarities. Curr. Forestry Rep. 7, 230–244. doi: 10.1007/s40725-021-00145-8

Crossref Full Text | Google Scholar

Shen, X., Liu, B., Lu, X., and Fan, G. (2017). enSpatial and temporal changes in daily temperature extremes in China during 1960–2011. Theor. Appl. Climatology. 130, 933–943. doi: 10.1007/s00704-016-1934-3

Crossref Full Text | Google Scholar

Shi, X., Zhao, J., Wang, Y., Wu, G., Hou, Y., and Yu, C. (2025). enOptimized maxEnt modeling of catalpa bungei habitat for sustainable management under climate change in China. Forests. 16, 1150. doi: 10.3390/f16071150

Crossref Full Text | Google Scholar

Wang, P., Ma, L., Li, Y., Wang, S., Li, L., Yang, R., et al. (2016). enTranscriptome profiling of indole-3-butyric acid-induced adventitious root formation in softwood cuttings of the Catalpa bungei variety ‘YU-1’ at different developmental stages. Genes Genomics. 38, 145–162. doi: 10.1007/s13258-015-0352-8

Crossref Full Text | Google Scholar

Wang, Y., Wang, D., Zhang, D., Liu, Q., and Li, Y. (2024). enPredicting carbon storage of mixed broadleaf forests based on the finite mixture model incorporating stand factors, site quality, and aridity index. For. Ecosyst. 11, 100189. doi: 10.1016/j.fecs.2024.100189

Crossref Full Text | Google Scholar

Xiang, B., Wielgosz, M., Kontogianni, T., Peters, T., Puliti, S., Astrup, R., et al. (2024). enAutomated forest inventory: Analysis of high-density airborne LiDAR point clouds with 3D deep learning. Remote Sens. Environ. 305, 114078. doi: 10.1016/j.rse.2024.114078

Crossref Full Text | Google Scholar

Xiao, Y., Ma, W., Lu, N., Wang, Z., Wang, N., Zhai, W., et al. (2019). enGenetic Variation of Growth Traits and Genotype-by-Environment Interactions in Clones of Catalpa bungei and Catalpa fargesii f. duclouxii. Forests. 10, 57. doi: 10.3390/f10010057

Crossref Full Text | Google Scholar

Xiao, Y., Wang, Z., Wang, J., Yun, H., Ling, J., Zhai, W., et al. (2025). Growth adaptability and stability in Catalpa bungei clones: the role of genetics and environment. Forestry Res. 5, 0–0. doi: 10.48130/forres-0025-0003

PubMed Abstract | PubMed Abstract | Crossref Full Text | Google Scholar

Xie, D., Huang, H., Feng, L., Sharma, R. P., Chen, Q., Liu, Q., et al. (2023). enAboveground biomass prediction of arid shrub-dominated community based on airborne liDAR through parametric and nonparametric methods. Remote Sens. 15, 3344. doi: 10.3390/rs15133344

Crossref Full Text | Google Scholar

Xu, Q., Yang, F., Hu, S., He, X., and Hong, Y. (2024). enTree height–diameter model of natural coniferous and broad-leaved mixed forests based on random forest method and nonlinear mixed-effects method in Jilin Province, China. Forests. 15, 1922. doi: 10.3390/f15111922

Crossref Full Text | Google Scholar

Yang, Z., Liu, Q., Luo, P., Ye, Q., Duan, G., Sharma, R. P., et al. (2020). enPrediction of individual tree diameter and height to crown base using nonlinear simultaneous regression and airborne liDAR data. Remote Sens. 12, 2238. doi: 10.3390/rs12142238

Crossref Full Text | Google Scholar

Zeide, B. (1993). enAnalysis of growth equations. For. Sci. 39, 594–616. doi: 10.1093/forestscience/39.3.594

Crossref Full Text | Google Scholar

Zhang, H., Feng, Z., Shen, C., Li, Y., Feng, Z., Zeng, W., et al. (2022). enRelationship between the geographical environment and the forest carbon sink capacity in China based on an individual-tree growth-rate model. Ecol. Indic. 138, 108814. doi: 10.1016/j.ecolind.2022.108814

Crossref Full Text | Google Scholar

Zhang, X., Fu, L., Sharma, R. P., He, X., Zhang, H., Feng, L., et al. (2021). enA nonlinear mixed-effects height-diameter model with interaction effects of stand density and site index for larix olgensis in Northeast China. Forests. 12, 1460. doi: 10.3390/f12111460

Crossref Full Text | Google Scholar

Zhao, Y., Feng, Y., Yang, C., Qiao, J., Zhou, H., Li, F., et al. (2022). enGenetic parameters and genotype–environment interactions in paulownia clonal tests in temperate and subtropical regions of China. Forests. 13, 2113. doi: 10.3390/f13122113

Crossref Full Text | Google Scholar

Zhou, X., Li, Z., Liu, L., Sharma, R. P., Guan, F., and Fan, S. (2023). Constructing two-level nonlinear mixed-effects crown width models for Moso bamboo in China. Front. Plant Sci. 14. doi: 10.3389/fpls.2023.1139448

PubMed Abstract | PubMed Abstract | Crossref Full Text | Google Scholar

Zhou, L., Meng, R., Tan, Y., Lv, Z., Zhao, Y., Xu, B., et al. (2022). enComparison of UAV-based LiDAR and digital aerial photogrammetry for measuring crown-level canopy height in the urban environment. Urban Forestry Urban Greening. 69, 127489. doi: 10.1016/j.ufug.2022.127489

Crossref Full Text | Google Scholar

Zhu, X., Huang, T., Liu, Z., Bai, L., Yang, Y., Ye, J., et al. (2025). Developing a generalized nonlinear mixed-effects biomass model at stand-level under different age conditions for Chinese fir based on LiDAR and ground survey data in southern China. Front. Plant Sci. 16. doi: 10.3389/fpls.2025.1532138

PubMed Abstract | PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: UAV lidar, DBH prediction, nonlinear mixed-effects model (NLME), genotype differences, Catalpa bungei

Citation: Zhang Y, Zhang M, Chen Q, Fu L, Ma W, Duan G, Fu X, Zheng Z, Wu C, Wang Q, Shun Y and Li P (2025) Predicting individual tree diameter at breast height for genetically diverse Catalpa bungei using nonlinear mixed-effects models and UAV LiDAR data. Front. Plant Sci. 16:1716546. doi: 10.3389/fpls.2025.1716546

Received: 30 September 2025; Accepted: 25 November 2025; Revised: 21 November 2025;
Published: 19 December 2025.

Edited by:

Yalin Wu, Shenzhen Hospital, Peking University, China

Reviewed by:

Xiangjin Shen, Chinese Academy of Sciences (CAS), China
Pei He, Northeast Forestry University, China

Copyright © 2025 Zhang, Zhang, Chen, Fu, Ma, Duan, Fu, Zheng, Wu, Wang, Shun and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Qiao Chen, Y2hlbmdxaXFvQDE2My5jb20=

These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.