Improving parameters estimation of a truncated Poisson regression model based on meta-heuristic optimization algorithms

Basheer, Ghalya Tawfeeq; Waleed Mahmood, Shaimaa; Algamal, Zakariya Yahya

doi:10.3389/fams.2026.1744058

ORIGINAL RESEARCH article

Front. Appl. Math. Stat., 04 February 2026

Sec. Statistics and Probability

Volume 12 - 2026 | https://doi.org/10.3389/fams.2026.1744058

This article is part of the Research TopicNew Frontiers in the Application of Mathematics to Biological SciencesView all 3 articles

Improving parameters estimation of a truncated Poisson regression model based on meta-heuristic optimization algorithms

Ghalya Tawfeeq Basheer¹

Shaimaa Waleed Mahmood²

Zakariya Yahya Algamal²^*^†

¹Department of Operations Research and Intelligent Techniques, University of Mosul, Mosul, Iraq
²Department of Statistics and Informatics, University of Mosul, Mosul, Iraq

The paper discusses computational and numerical challenges that are associated with the truncation of the information and which change the usual Poisson likelihood by the introduction of black kite optimization algorithm. Real data is used to demonstrate a significant improvement in healthcare and medical research. Truncated Poisson regression models (TPRM) are essential for analyzing count data where zero counts are unobserved, a common scenario in many real-world applications, such as healthcare and medical research. However, parameter estimation in such models often suffers from bias and inefficiency due to the complexity induced by truncation. This study proposes an improved parameter estimation approach for TPRM by leveraging meta-heuristic optimization algorithms. Specifically, we integrate state-of-the-art meta-heuristics, black kite optimization algorithm (BKA), to optimize the likelihood function and overcome the limitations of traditional iterative methods such as Newton-Raphson and quasi-Newton algorithms. Using extensive Monte Carlo simulations and real data application, we evaluate the performance of the proposed method under varying sample sizes and covariate structures. The results demonstrate that our meta-heuristic-based estimator significantly reduces mean squared error compared to conventional estimators, enhancing model reliability and predictive accuracy. The proposed approach offers a robust and efficient alternative for parameter estimation in truncated Poisson regression, with potential applications in epidemiology, ecology, and other fields dealing with truncated count data.

1 Introduction

Count data modeling is a statistical field of study directed at such variables as dependent variables that are identifications of a non-negative integer number of events or occurrences, i.e., the quantity of visits to a doctor, traffic incidents, or children within a family [1, 2]. Leading to the right skewed distribution, among other characteristics unlike continuous outcomes of count data are discrete that tend to have a distorted distribution centered on lower values with a high proportion of zeros [3].

Poisson regression, a classical modeling strategy of the count data, presupposes the idea of the counts being distributed according to Poisson distribution with its means (equidispersion). This model presents the expected count as an exponential of explanatory variables that makes its interpretation to be simple in the generalized linear model framework. Empirical data however, generally do not fit the equidispersion assumption, therefore assuming they are over dispersed, which is very common, or under dispersed, which is rare. Such cases are dealt with by substituting such models with newer models with more parameters such as the Negative Binomial regression that has an extra parameter to fit flexibility to variance independent of the mean. Other distributions intended to handle different dispersion are the Generalized Poisson Conway-Maxwell-Poisson (COM-Poisson) and similar distributions [4, 5].

The other difficulty which is commonly seen is the occurrence of excess zeros; that is more counts than are predicted by standard count models. At the cost of this, zero-inflated models and hurdle models have been designed. The two components have independent structure each component modeling the result on the binary zero-non zero and the count distribution of the positive values, and thus generates better fitting and inference in situations where zeros are produced by separate mechanisms [6, 7].

The truncated Poisson regression model is a statistical technique used to analyze a count-type of data that are truncated; such as dynamic observations are excluded in the sample when smaller or larger than some integer limits [8, 9]. Truncated Poisson regression contrasts with the usual Poisson regression model in that the counts take a truncated distribution due to the requirement that the counts exceed (left truncation) or are less than (right truncation) a given value [10]. This method would consider the fact that the realized counts do not fall into the grand scope of counts that might be observed hence avoid bias in parameter estimation and wrong inference that would have been made had truncation not been considered [11, 12].

Optimization means achieving the best result under certain conditions. Engineers must make many technical and managerial decisions throughout the design, construction, and upkeep of any engineering system. The main purpose of all these decisions is to either minimize the effort or to maximize the benefit. In any situation, the effort required or the amount of benefit can be defined as a function of certain decision variables. Optimization is then the process of choosing the conditions that result in the highest or lowest value of a function [13, 14].

There is no one way that can solve all optimization problems efficiently. Due to this, various optimization methods have been invented to handle many types of optimization problems. These methods are also called mathematical programming and are typically included in the field of operations research. Operations research uses scientific methods to solve problems related to decision making and to find the best solutions [13, 14]. In recent years, new optimization methods have gained popularity and are widely used to solve difficult engineering optimization problems. Such as genetic algorithms, simulated annealing, particle swarm optimization, ant colony optimization, black-winged kite optimization algorithm [13].

The black kite optimization algorithm (BKA) is a meta-heuristic optimization algorithm based on the migration and hunting patterns of the black kite. The BKA brings together the Cauchy mutation technique and the Leader strategy to increase its ability to find global solutions and speed up convergence. This method manages achieves a balance both global and local knowledge to find good solutions [15]. It should be noted that not all problems can be solved by one algorithm. There is no meta-heuristic algorithm that is optimal in all optimization problems. A meta-heuristic algorithm can do very well with certain types of issues, but may not be as successful with others. The advancement of technology and the rise in problem complexity make it hard for some traditional algorithms to solve them [15].

The key shortcoming of parameter estimation in the truncated Poisson regression models can be viewed as the complex effect of the likelihood that is induced by truncation as it may necessitate the employment of the non-linear optimization tools and cause other problems with acquiring consistent and efficient estimators. Truncated Poisson models can be estimated using maximum likelihood in which the likelihood functions are complex with the normalizing constant being dependent upon the truncation limits. That increases the computational burden of parameter estimation and frequently requires non-linear optimization techniques iterative in nature, which have the potential to converge slowly or at local as opposed to global optima [16].

The weaknesses of quasi-Newton algorithms in parameter estimation of the truncated Poisson regression model mostly follow these issues because of the difficulty of the truncated likelihood function, which influence convergence and speed of computation. In the truncated Poisson measuring and computing log-likelihood, the log-likelihood contains a normalizing constant that relies on the truncation that results in the likelihood function to be non-linear and consequently more strenuous compared with that in the conventional Poisson. Quasi-Newton optimization methods do not compute the Hessian explicitly but can become sensitive to this complexity and thus slow to converge or to find global (as opposed to local) maxima. Quasi-Newton optimization can be sensitive to starting parameter values, and is iterative, whereas truncation or small sample size limits often render it insensitive to such start values. Such sensitivity may induce instability or convergence to irreliable parameter estimates [17, 18].

The paper therefore concerns the difficulty of causing the parameters of the truncated Poisson regressions to get the accurate estimation of parameters based on maximum likelihood approach algorithms. In particular, the paper discusses computational and numerical challenges that are associated with the truncation of the information and which change the usual Poisson likelihood by the introduction of black kite optimization algorithm. The complexity typically gets in the way of rapid convergence, arithmetical precariousness and can be biased when using traditional optimization schemes like Newton-Raphson, Gauss-Newton, or perhaps quasi-Newton procedures.

2 Poisson regression

The modeling of count data is based on Poisson regression. It was the earliest model to explicitly model counts and it remains at the foundation of the numerous kinds of count models that can be used by analysts, it is also called log-linear regression. The dependent variable in this model is Poisson distributed and the dependent variable is linked with the independent variables using a log link function which yields a linear equation [4, 6].

The Poisson regression model states that the y_i is a random variable distributed as Poisson with parameter λ_i, which depends on the regressors x_i. The main formula of the model is [19]:

\begin{array}{l} P (Y_{i} = y_{i} | x_{i}) = \frac{e^{- λ} λ^{y_{i}}}{y_{i}!}, y_{i} = 0, 1, 2 \dots, & (1) \end{array}

The most popular formulation for λ_i is the loglinear model

\begin{array}{l} l n λ_{i} = x_{i}^{T} β & (2) \end{array}

where $x_{i}^{T} β = β_{0} + β_{1} x_{i 1} + \dots + β_{n} x_{i n}$ , $x_{i}^{T}$ represents the vector of independent variables and β is the regression coefficients. The expected number of events is given by

\begin{array}{l} E [y_{i} | x_{i}] = V a r [y_{i} | x_{i}] = λ_{i} = e^{x_{i}^{T} β} & (3) \end{array}

The log likelihood function is

\begin{array}{l} l n L = \sum_{i = 1}^{n} [- λ_{i} + y_{i} x_{i}^{T} β - l n y_{i}!] & (4) \end{array}

3 Truncated Poisson distribution

Let y a discrete random variable follow Poisson distribution with mean and variance (λ) then the probability mass function of Poisson distribution is [19, 20]:

\begin{array}{l} P (y_{i}; λ) = \frac{e^{- λ} λ^{y_{i}}}{y_{i}!}, y_{i} = 0, 1, 2, \dots, λ > 0 & (5) \end{array}

A sample is considered truncated if the observations are limited to a certain portion of the population distribution. The truncated distribution is a subset of the untruncated distribution, with truncation occurring either from the left side [left truncated (y > l)] at a point, l, from the right side [right truncated (y < k)] at a point, k, or from both sides within the interval [l, k]. The probability density function of the shortened random variable can be described as a condition a distribution, as demonstrated below [19, 21]:

Case (1): left truncated Poisson at zero (y_i > 0)

The zero-truncated Poisson distribution (ZTP) was first introduced by David and Johnson [36], which is distributed as y ~ ZTP(λ) and it is one of the models of logarithmic linear regression. The probability mass function of the zero truncated Poisson (ZTP) distribution is [3, 22, 23]:

\begin{array}{l} P (y_{i} | y_{i} > 0) = \frac{P (y_{i}; λ)}{Pr [y_{i} > 0]} \\ = \frac{\frac{e^{- λ} λ^{y_{i}}}{y_{i}!}}{1 - Pr [y = 0]} \\ = \frac{e^{- λ} λ^{y_{i}}}{(1 - e^{- λ}) y_{i}!}, y_{i} = 1, 2, 3, \dots & (6) \end{array}

Case (2): right truncated Poisson (y_i ≤ k)

The Poisson distribution becomes right-truncated when truncation occurs at (k) where (y_i ≤ k). The probability mass function of the right truncated Poisson distribution takes the following form [24, 25]:

\begin{array}{l} P (y_{i} | y_{i} \leq k) = \frac{P (y_{i}; λ)}{Pr [y_{i} \leq k]} \\ = \frac{e^{- λ} λ^{y_{i}}}{y_{i}! (\sum_{z = 0}^{k} \frac{e^{- λ} λ^{z}}{z!})} \\ = \frac{λ^{y_{i}}}{(\sum_{z = 0}^{k} \frac{λ^{z}}{z!}) y_{i}!}, y_{i} = 0, 1, 2, \dots, k & (7) \end{array}

Case (3): double truncated Poisson (l ≤ y_i ≤ k)

The Double truncated Poisson data result from merging left truncated and right truncated Poisson data types. The probability mass function of the double truncated Poisson distribution takes the following form [26].

\begin{array}{l} P (y_{i} | l \leq y_{i} \leq k) = \frac{P (x_{i}; λ)}{Pr [l \leq y_{i} \leq k]} \\ = \frac{λ^{y_{i}}}{(\sum_{z = l}^{k} \frac{λ^{z}}{z!}) y_{i}!}, y_{i} = l, l + 1, l + 2, \dots, k & (8) \end{array}

The analysis of right truncation and double truncation for count data has received less scholarly focus compared to left truncation. One possible explanation is that Left-truncation occurs more frequently than right-truncation [26].

4 Truncated Poisson regression model

The truncated Poisson regression model is one of the models of logarithmic linear regression for the dependent variable (y_i) and is defined by the following formula [19, 21, 27, 28]:

\begin{array}{l} y_{i} = e^{x_{i}^{T} β + U_{i}} U ~ P (λ) & (9) \end{array}

$w h e r e x_{i}^{T} β = β_{0} + β_{1} x_{i 1} + \dots + β_{n} x_{i n}$

\begin{array}{l} y_{i} ~ P (λ) \end{array}

The distribution parameter of the response variable (y_i) can be expressed as [29–31]:

\begin{array}{r} λ_{i} = e^{x_{i}^{T} β} \\ l n λ_{i} = x_{i}^{T} β \end{array}

In The truncated Poisson regression model the observations of (y_i, x_i) are obtained only for part of the population. The main goal of regression analysis involves parameter estimation to understand the relationship between dependent variable and independent variables. The maximum likelihood estimator serves to calculate parameter estimates for truncated Poisson regression models.

5 Maximum likelihood estimation (MLE)

The estimation of parameters represents a fundamental research topic that attracts mathematical statistics researchers, because new estimation methods require accurate parameter estimation and optimal estimator identification [32].

Case (1): zero truncated Poisson regression model

The maximum likelihood function for the zero truncated Poisson regression model derives from the conditional probability function of the zero truncated Poisson distribution shown in Equation 6.

\begin{array}{l} L (β | y) = Π_{i = 1}^{n} \frac{e^{- λ} λ^{y_{i}}}{(1 - e^{- λ}) y_{i}!} \\ ln L (β | y) = \sum_{i = 1}^{n} [y_{i} ln (λ) - λ - ln ({1 - e}^{- λ}) - ln (y_{i}!)] \\ ln L (β | y) = \sum_{i = 1}^{n} [y_{i} β - e^{x_{i}^{T} β} - ln ({1 - e}^{- x_{i}^{T} β}) - ln (y_{i}!)] & (10) \end{array}

The zero truncated Poisson maximum likelihood estimators require the derivative of Equation 10 with respect to β to obtain their values as:

\begin{array}{l} \frac{\partial l n L}{\partial β} = \sum_{i = 1}^{n} [y_{i} - \frac{e^{x_{i}^{T} β}}{(1 - e^{- e^{x_{i}^{T} β}})}] x_{i} = 0 & (11) \end{array}

Case (2): right truncated Poisson regression model

The maximum likelihood function for the right truncated Poisson regression model derives from the conditional probability function of the right truncated Poisson distribution shown in Equation 7.

\begin{array}{l} L (β | y) = Π_{i = 1}^{n} \frac{λ^{y_{i}}}{(\sum_{z = 0}^{k} \frac{λ^{z}}{z!}) y_{i}!} \\ ln L (β | y) = \sum_{i = 1}^{n} [y_{i} ln (λ) - ln (y_{i}!) - ln (\sum_{z = 0}^{k} \frac{λ^{z}}{z!})] \\ ln L (β | y) = \sum_{i = 1}^{n} [y_{i} β - ln (y_{i}!) - ln (\sum_{z = 0}^{k} \frac{{(x_{i}^{T} β)}^{z}}{z!})] & (12) \end{array}

We derive maximum likelihood right truncated Poisson regression estimators by taking the first derivative of β and setting it to zero according to the following:

\begin{array}{l} \frac{\partial l n L}{\partial β} = \sum_{i = 1}^{n} [y_{i} - \frac{\sum_{z = 0}^{k} \frac{x_{i} z {(x_{i}^{T} β)}^{z}}{z!}}{\sum_{z = 0}^{k} \frac{{(x_{i}^{T} β)}^{z}}{z!}}] = 0 & (13) \end{array}

Case (3): double truncated Poisson regression model

The maximum likelihood function for the double truncated Poisson regression model derives from the Equation 8.

\begin{array}{l} L (β | y) = Π_{i = 1}^{n} \frac{λ^{y_{i}}}{(\sum_{z = l}^{k} \frac{λ^{z}}{z!}) y_{i}!} \\ ln L (β | y) = \sum_{i = 1}^{n} [y_{i} ln (λ) - ln (y_{i}!) - ln (\sum_{z = l}^{k} \frac{λ^{z}}{z!})] \\ ln L (β | y) = \sum_{i = 1}^{n} [y_{i} β - ln (y_{i}!) - ln (\sum_{z = l}^{k} \frac{{(x_{i}^{T} β)}^{z}}{z!})] & (14) \end{array}

The maximum likelihood double truncated Poisson regression estimators can be determined differentiating Equation 14 with respect to β, giving

\begin{array}{l} \frac{\partial l n L}{\partial β} = \sum_{i = 1}^{n} [y_{i} - \frac{\sum_{z = l}^{k} \frac{x_{i} z {(x_{i}^{T} β)}^{z}}{z!}}{\sum_{z = l}^{k} \frac{{(x_{i}^{T} β)}^{z}}{z!}}] = 0 & (15) \end{array}

Equations 11, 13, 15 contain non-linear relationships between parameters which require iterative methods including Newton Raphson or Fisher scoring. In this research, we employ the traditional optimization method [Quasi newton method (BFGS)] alongside meta heuristic algorithms to estimate parameters of truncated Poisson regression model in three different cases, as well as improving a new algorithm to enhance the solution process.

6 Broyden–Fletcher–Goldfarb– Shanno Method (BFGS)

The BFGS method is one of the quasi-Newton algorithms for unconstrained optimization problem, for finding a point x^* ∈ Rⁿ let:

\begin{array}{l} min f (x) x * \in R^{n} & (16) \end{array}

Where the objective function f:Rⁿ → R is a twice continuously differentiable objective function. The Broyden–Fletcher–Goldfarb–Shanno Method (BFGS) performs an iterative process that follows this procedure [13, 33]:

1. Start with x₁, an initial point and [H₁], a positive definite symmetric matrix n × n. where [H₁] is the identity matrix [I]. Set i = 1 the iteration number.

2. Determine the gradient ∇f_i at point x₁, then set:

\begin{array}{l} S_{i} = - [H_{1}] \nabla f_{i} & (17) \end{array}

3. Determine the optimal step length $λ_{i}^{*}$ moving along direction S_i, then set:

\begin{array}{l} x_{i + 1} = x_{i} + λ_{i}^{*} S_{i} & (18) \end{array}

4. Check if the new point x_i+1 represents an optimal solution. If x_i+1 is optimal, stop. Otherwise, go to step 5.

5. Update the matrix [H₁] as:

\begin{array}{l} [H_{i + 1}] = [H_{i}] + (1 + \frac{g_{i}^{T} [H_{i}] g_{i}}{d_{i}^{T} g_{i}}) \\ \times \frac{d_{i} d_{i}^{T}}{d_{i}^{T} g_{i}} - \frac{d_{i} g_{i}^{T} [H_{i}]}{d_{i}^{T} g_{i}} - \frac{[H_{i}] g_{i} d_{i}^{T}}{d_{i}^{T} g_{i}} & (19) \end{array}

Where

\begin{array}{l} g_{i} = \nabla f (x_{i + 1}) - \nabla f (x_{i}) = \nabla f_{i + 1} - \nabla f_{i} & (20) \end{array}

\begin{array}{l} d_{i} = x_{i + 1} - x_{i} & (21) \end{array}

Set i = i + 1 the new iteration number, and go to step 2.

7 Black winged kites algorithm (BKA)

In (2024) Wang et al. [15] introduced the black-winged kite optimization algorithm (BKA) which represents a groundbreaking meta-heuristic algorithm. The black-winged kite uses its survival techniques as the basis for its optimization algorithm. This bird utilizes excellent hovering capabilities in addition to its unexpected hunting proficiency. The black-winged kite feeds on insects together with birds and reptiles and small mammals. A model was developed through analysis of black-winged kite movement patterns and hunting abilities [15, 34, 35].

7.1 Initialization

The first step in BKA requires generating random solutions to establish the population as depicted in Algorithm 1. Each Black-winged kite receives its position through a uniform distribution:

\begin{array}{l} X_{i} = B K_{l b} + r a n d (B K_{u b} - B K_{l b}) & (22) \end{array}

Where BK_lb, BK_ub represent the lower and upper bounds of i^th black kites, respectively, and rand ∈ [0, 1] is a random number [15, 34, 35].

Algorithm 1

Algorithm 1. Black winged kite algorithm.

7.2 Attacking

The black-winged kite waits quietly before dropping to attack its prey after matching its wings and tail to the wind speed. The black-winged kite underwent two different assault scenarios throughout the global exploration phase of the BKA. The kite maintains its hovering position in the air while it readjusts its position to reach the target at its optimal attack angle. The kite maintains its position in the air while scanning for targets before striking down the most vulnerable one it detects. The attack behavior model uses the following mathematical expression:

\begin{array}{l} x_{t + 1}^{i, j} = {\begin{array}{l} x_{t}^{i, j} + n (1 + sin (r)) \times x_{t}^{i, j} & p < r \\ x_{t}^{i, j} + n \times (2 r - 1) \times x_{t}^{i, j} & e l s e \end{array} & (23) \end{array}

\begin{array}{l} n = 0.05 \times e^{- 2 * {(\frac{t}{T})}^{2}} & (24) \end{array}

Where $x_{t + 1}^{i, j}, x_{t}^{i, j}$ is the position of the i^th Black-winged kites in the j^th dimension at iteration steps (t) and (t + 1)^th, respectively. r ∈ [0, 1] is a random number and p = 0.9 is a constant value. T represents the total number of iterations and t is the current iteration [15, 34, 35].

7.3 Migration

Bird migration occurs as an intricate behavior because both climate conditions and food availability serve as influential environmental elements. Bird migration exists as an adaptation to seasonal changes through which numerous birds move from northern regions to southern areas for improved living conditions and resources. Migration teams follow leaders who need excellent navigation abilities to achieve success. Our hypothesis relies on bird migration principles which state that if the fitness value of the current population is lower than that of the random population, the leader will give up leadership and join the migratory population, indicating that it is not suitable to lead the population forward. On the other hand, if the fitness value of the current population is higher than that of the random population, then the population will be guided to its destination. The approach enables automatic selection of superior leaders to achieve migration success. The mathematical model describes for the migration patterns of black-winged kites as follows:

\begin{array}{l} x_{t + 1}^{i, j} = {\begin{array}{c} x_{t}^{i, j} + C (0, 1) \times (x_{t}^{i, j} - L_{t}^{j}) & F_{i} < F_{r i} \\ x_{t}^{i, j} + C (0, 1) \times (L_{t}^{j} - m \times x_{t}^{i, j}) & e l s e \end{array} & (25) \end{array}

\begin{array}{l} m = 2 \times sin (r + π / 2) & (26) \end{array}

The parameter (m) is used to scale the current position of the kite in the update term, and scales the step size toward the leader perturbed by Cauchy mutation.

Where $L_{t}^{j}$ is the leading scorer of the black-winged kites in the j^th dimension of the t^th iteration so far. F_i is the fitness value of the current position obtained by any black-winged kite in the j^th dimension of the t^th iteration. F_ri is the fitness value of the random position obtained from any black kites in the j^th dimension of the t^th iteration, and C(0, 1) is the Cauchy mutation. The probability density function of the Cauchy distribution is [15, 34, 35]:

\begin{array}{l} f (x, δ, μ) = \frac{1}{π} \frac{δ}{δ^{2} + {(x - μ)}^{2}} - \infty < x < \infty & (27) \end{array}

When δ = 0, μ = 1, then the standard form of the Cauchy distribution becomes the following:

\begin{array}{l} f (x, δ, μ) = \frac{1}{π} \frac{1}{x^{2} + 1} - \infty < x < \infty & (28) \end{array}

8 Proposed algorithm

In this algorithm, the classical optimization algorithm, BFGS, was combined with the black-winged kite optimization algorithm (BFGS-BKA). where the randomness and speed of the black-winged kite optimization algorithm are used to find the optimal step length $λ_{i}^{*}$ in each iteration, while using parameter values before truncated as initial values for BFGS algorithm. The basic steps of this algorithm can be describes as:

Step (1): start with x₁, an initial point and [H₁], a positive definite symmetric matrix n × n. where [H₁] is the identity matrix [I]. Set i = 1 the iteration number.

Step (2): determine the gradient ∇f_i at point x₁, then set:

S_i = −[H₁]∇f_i

Step (3): find the optimal step length $λ_{i}^{*}$ by black-winged kite optimization algorithm:

1. Randomly generate the initial population of black-winged kite between λ_lb and λ_ub as λ₁, λ₂, ........, λ_N. Evaluate the fitness value of each black-winged kite as (λ₁), f(λ₂), ......, f (λ_N). Set t = 1 the iteration number.

2. BKA chooses the individual with the best fitness value to become the leader λ_L in the initial population.

3. Find f_best and λ_L, the black-winged kite algorithm initiates its global exploration and search during its attack behavior according the following equation:

\begin{array}{l} λ_{t + 1}^{i, j} = {\begin{array}{c} λ_{t}^{i, j} + n (1 + sin (r)) \times λ_{t}^{i, j} & p < r \\ λ_{t}^{i, j} + n \times (2 r - 1) \times λ_{t}^{i, j} & e l s e \end{array} \end{array}

4. In Bird migration, will be the position is update based on the fitness value of the leader as:

\begin{array}{l} λ_{t + 1}^{i, j} = {\begin{array}{c} λ_{t}^{i, j} + C (0, 1) \times (λ_{t}^{i, j} - L_{t}^{j}) & F_{i} < F_{r i} \\ λ_{t}^{i, j} + C (0, 1) \times (L_{t}^{j} - m \times λ_{t}^{i, j}) & e l s e \end{array} \end{array}

5. Test the convergence of the current solution. If the convergence criterion is not satisfied, go to step (3) and update the iteration number as t = t + 1, until convergence occurs and the optimal value of λ is determined.

Step (4): find new point:

\begin{array}{l} x_{i + 1} = x_{i} + λ_{i}^{*} S_{i} \end{array}

Step (5): check if the new point x_i+1 represents an optimal solution. If x_i+1 is optimal, stop. Otherwise, go to step 6.

Step (6): update the matrix [H₁] as:

\begin{array}{l} [H_{i + 1}] = [H_{i}] + (1 + \frac{g_{i}^{T} [H_{i}] g_{i}}{d_{i}^{T} g_{i}}) \frac{d_{i} d_{i}^{T}}{d_{i}^{T} g_{i}} - \frac{d_{i} g_{i}^{T} [H_{i}]}{d_{i}^{T} g_{i}} - \frac{[H_{i}] g_{i} d_{i}^{T}}{d_{i}^{T} g_{i}} \end{array}

Where

\begin{array}{l} g_{i} = \nabla f (x_{i + 1}) - \nabla f (x_{i}) = \nabla f_{i + 1} - \nabla f_{i} \\ d_{i} = x_{i + 1} - x_{i} \end{array}

Step (7): set i = i + 1 the new iteration number, and go to step 2.

9 Real data

This section contains real dataset to demonstrate the empirical importance of the algorithm (BFGS-BKA), where the real dataset is for the zero truncated Poisson regression model study. We use the (Arizona MedPar database, 1991). The dataset contains 1,495 observations the response variable, Lose, represents length of hospital stay. The explanatory variables for this model include an indicator of White (Patient identifies themselves as Caucasian, binary), Hmo (Patient belongs to a Health Maintenance Organization, binary), Type2 (Urgent admission, binary) and Type3 (Elective admission, binary).

As illustrated by Table 1, across all four methods, the parameter estimates are remarkably consistent, indicating stable estimation despite the choice of optimization algorithm. The numerical differences between the methods on each parameter are very small showing that all methods converge to nearly identical solutions. The MSE measures the average squared difference between observed and predicted values, serving as an indicator of model fit quality and estimator accuracy. Newton Raphson, Davidon–Fletcher–Powell (DFP) method, and BFGS yield the same MSE value of 0.0225, whereas the BFGS-BKA method achieves a slightly lower MSE of 0.0211. This implies that integrating the BKA algorithm with BFGS potentially enhances estimation accuracy or predictive performance. This improvement indicates that BFGS-BKA may better navigate the complex truncated likelihood surface to find a more optimal parameter set or avoid local optima.

Table 1

Table 1. Results zero-truncated Poisson regression model for the medpar data.

10 Simulation study

This section presented a simulation study to demonstrate the empirical importance of the algorithm (BFGS-BKA) to evaluate the accuracy of MLEs of parameters estimation of a truncated Poisson regression model of three cases left truncated Poisson regression model (LTPRM), right truncated Poisson regression model (RTPRM) and double truncated Poisson regression model (DTPRM). We consider dimension P = 3, 7, 12 and the sample sizes n = 25, 50, 100, 200.

10.1 Simulation results of the left truncated Poisson regression model

This section represented results for fit the left truncated Poisson regression model, with responses truncated at L = 0. Where independent variables are randomly generated according to a normal distribution and the data were simulated with true values beta = [0.5; −0.3; 0.2], beta = [0.5; −0.3; 0.2; 0.1; −0.1; 0.3; −0.2] and beta = [0.5; −0.3; 0.2; 0.1; −0.1; 0.3; −0.2; 0.4; −0.25; 0.15; 0.05; −0.05]. Tables 2–4 show the simulation results for LTPRM.

Table 2

Table 2. Simulation results for estimating parameters of a left truncated Poisson regression model at zero (y > 0; P = 3).

Table 3

Table 3. Simulation results for estimating parameters of a left truncated Poisson regression model at zero (y > 0; P = 7).

Table 4

Table 4. Simulation results for estimating parameters of a left truncated Poisson regression model at zero (y > 0; P = 12).

10.2 Simulation results of the right truncated Poisson regression model

This section represented results for fit the right truncated Poisson regression model, with responses truncated at U = 5. Where independent variables are randomly generated according to a normal distribution using matlab randn (n,p), also the data were simulated with true values beta = [0.5; −0.3; 0.2], beta = [0.5; −0.3; 0.2; 0.1; −0.1; 0.3; −0.2] and beta = [0.5; −0.3; 0.2; 0.1; −0.1; 0.3; −0.2; 0.4; −0.25; 0.15; 0.05; −0.05]. Tables 5–7 show the simulation results for (RTPRM).

Table 5

Table 5. Simulation results for estimating parameters of a right truncated Poisson regression model (P = 3).

Table 6

Table 6. Simulation results for estimating parameters of a right truncated Poisson regression model (P = 7).

Table 7

Table 7. Simulation results for estimating parameters of a right truncated Poisson regression model (P = 12).

10.3 Simulation results of the double truncated Poisson regression model

This section represented results for fit the double truncated Poisson regression model, with responses truncated at L = 0 and U = 5. Where Independent variables are randomly generated according to a normal distribution using matlab randn (n,p), also the data were simulated with true values beta = [0.5; −0.3; 0.2], beta = [0.5; −0.3; 0.2; 0.1; −0.1; 0.3; −0.2] and beta = [0.5; −0.3; 0.2; 0.1; −0.1; 0.3; −0.2; 0.4; −0.25; 0.15; 0.05; −0.05]. Tables 8–10 show the simulation results for (DTPRM).

Table 8

Table 8. Simulation results for estimating parameters of a double truncated Poisson regression model (P = 3).

Table 9

Table 9. Simulation results for estimating parameters of a double truncated Poisson regression model (P = 7).

Table 10

Table 10. Simulation results for estimating parameters of a double truncated Poisson regression model (P = 12).

From all the Tables above, we can concluded that across all sample sizes, parameter estimates from all four methods are highly similar, demonstrating consistent convergence to approximately the same values. This consistency suggests each optimization method is capable of locating reliable parameter estimates even in smaller samples. As sample size increases, the parameter estimates stabilize and vary less across methods. This reflects the expected property of maximum likelihood estimators: increased sample size yields more precise and stable estimates. As sample size increases (from 25 to 200), the estimated MSE decreases monotonically for all methods, demonstrating improved estimation precision with larger data consistent with statistical theory. BFGS-BKA consistently achieves the lowest MSE at each sample size, indicating superior estimation accuracy. Classical methods such as Newton-Raphson, DFP, and BFGS provide similar MSE values, slightly higher than BFGS-BKA. The performance gap, though sometimes small in absolute terms, illustrates the advantage of using the Black Kite Optimization (BKA) to enhance the classical BFGS method, improving optimization over the complex truncated likelihood surface. The similarity of parameter estimates across methods confirms robustness of the numerical algorithms when applied to truncated Poisson regression models.

Integrating the Black Kite Optimization algorithm with BFGS consistently improves the optimization process, yielding more accurate and stable estimates. This suggests that metaheuristic approaches like BKA help avoid local optima and improve convergence speed when dealing with truncated likelihood functions. For small sample sizes where numerical instability and local optima are more problematic, BFGS-BKA provides meaningful improvements. For larger datasets, while all methods perform well, BFGS-BKA still maintains a measurable edge in accuracy.

Figures 1–3 show the performance of Newton's, DFP, BFGS, and BFGS-BKA algorithms in terms of time that represents the average computation time (in seconds) used by the all algorithms in the case of successful runs. We can see how the proposed algorithm is better than the others because of the Cauchy mutation that causes better exploration and the leader strategy that causes faster convergence leading to a balance between global and local search.

Figure 1

Three bar graphs compare the execution time, in seconds, of different methods for various sample sizes. Graph (a) compares the Newton, DFP, BFGS, and BFGS-BKA methods for P=3; graph (b) does the same for P=7; and graph (c) for P=12. In each graph, the methods are represented by different colored bars, showing trends and differences in execution time as sample size increases from 25 to 200.

Figure 1. Computational time of left truncated Poisson regression model at zero (y > 0) (a) P = 3, (b) P = 7 and (c) P = 12.

Figure 2

Three bar charts compare execution times of four optimization methods (Newton, DFP, BFGS, BFGS-BKA) over different sample sizes. Chart (a) for P=3, (b) for P=7, and (c) for P=12, with execution time increasing consistently as sample sizes increase from 25 to 200. Each method's performance is visually distinguished by color.

Figure 2. Computational time of right truncated Poisson regression model (a) P = 3, (b) P = 7 and (c) P = 12.

Figure 3

Three bar charts compare execution times using four methods: Newton, DFP, BFGS, and BFGS-BKA. Chart (a) shows results for P=3; (b) for P=7; (c) for P=12. Execution time increases with sample size across all methods.

Figure 3. Computational time of double truncated Poisson regression model (a) P = 3, (b) P = 7 and (c) P = 12.

11 Conclusions

This paper addressed the critical challenge of accurately estimating parameters in truncated Poisson regression models, where standard maximum likelihood estimation is complicated by a truncated likelihood function that includes a non-trivial normalizing constant. The paper proposed the use of the BKO algorithm, a metaheuristic inspired by the hunting and migratory behavior of black kites, which aims to enhance exploration and exploitation capabilities when searching the parameter space. By leveraging BKO, the study seeks to improve the stability, convergence speed, and accuracy of parameter estimates in truncated Poisson models. Simulation studies and empirical analyses included in the paper demonstrate the superior performance of this approach compared to classical optimization methods, highlighting its potential as a robust and efficient solution for parameter estimation in truncated count data settings.

Data availability statement

The data analyzed in this study is subject to the following licenses/restrictions: the data will be under request. Requests to access these datasets should be directed to Zakariya Algamal, emFrYXJpeWEuYWxnYW1hbEB1b21vc3VsLmVkdS5pcQ==.

Author contributions

GB: Conceptualization, Formal analysis, Software, Writing – original draft, Writing – review & editing. SW: Conceptualization, Formal analysis, Methodology, Resources, Writing – original draft, Writing – review & editing. ZA: Supervision, Validation, Writing – review & editing.

Funding

The author(s) declared that financial support was not received for this work and/or its publication.

Conflict of interest

The author(s) declared that this work was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Generative AI statement

The author(s) declared that generative AI was not used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Ahmed Alangood HN, Algamal Z, Khaleel MA. Variable Selection in Poisson Regression Model based on Chaotic Meta-Heuristic Search Algorithm. BIO Web of Conferences, 97 (2024). doi: 10.1051/bioconf/20249700161

Crossref Full Text | Google Scholar

2. Algamal Z. Variable Selection in Count Data Regression Model based on Firefly Algorithm. Statistics, Optimization and Information Computing, 7 (2019). doi: 10.19139/soic.v7i2.566

Crossref Full Text | Google Scholar

3. Alwani ZIZ, Ibrahim AI, Yunus RM, Yusof F. Application of zero-truncated count data regression models to air-pollution disease. J Phys Conf Ser. (2021) 1988:012096. doi: 10.1088/1742-6596/1988/1/012096

Crossref Full Text | Google Scholar

4. Hilbe J-M. Modeling-Count-Data. Cambridge: Cambridge University Press (2014). doi: 10.1017/CBO9781139236065

Crossref Full Text | Google Scholar

5. Hawa NS, Mustafa MY, Kibria BMG, Algamal ZY. Bootstrap Liu-type estimator for Conway-Maxwell-Poisson regression model. Commun Stat Simul Comput. (2025) 1–12. doi: 10.1080/03610918.2025.2462680

Crossref Full Text | Google Scholar

6. Kara R, Yeşilova A. Zero truncated models in regression analysis: an examination of their advantages on small mean values. J Inst Nat Appl Sci. (2025) 30:102–12. doi: 10.53433/yyufbed.1590611

Crossref Full Text | Google Scholar

7. Alkhateeb A, Algamal Z. Jackknifed liu-type estimator in Poisson regression model. J Iran Stat Soc. (2020) 19:21–37. doi: 10.29252/jirss.19.1.21

Crossref Full Text | Google Scholar

8. Liu Y, Li W, Zhang X. A marginalized zero-truncated Poisson regression model and its model averaging prediction. Commun Math Stat. (2025) 13:527–70. doi: 10.1007/s40304-022-00312-8

Crossref Full Text | Google Scholar

9. Wani MK, Ahmad PB. One-inflated zero-truncated Poisson distribution: statistical properties and real life applications. Ann Data Sci. (2025) 12:639–66. doi: 10.1007/s40745-024-00526-3

Crossref Full Text | Google Scholar

10. Young DS, Roemmele ES, Yeh P. Zero-inflated modeling part I: traditional zero-inflated count regression models, their applications, and computational tools. WIREs Comput Stat. (2022) 14:e1541. doi: 10.1002/wics.1541

Crossref Full Text | Google Scholar

11. Qi Y, Yu-Zhu T, Yi-Jing Z, Yue W, Zhi-Bao M. An ordinal collaboration network model with zero truncated poisson latent variables and its application. Stat (2025) 14:e70040. doi: 10.1002/sta4.70040

Crossref Full Text | Google Scholar

12. Ünlü HK, Young DS, Yigiter A, Hilal Özcebe L. A mixture model with Poisson and zero-truncated Poisson components to analyze road traffic accidents in Turkey. J Appl Stat. (2022) 49:1003–17. doi: 10.1080/02664763.2020.1843610

PubMed Abstract | Crossref Full Text | Google Scholar

13. Rao SS. Engineering Optimization Theory and Practice. Hoboken, NJ: John Wiley & Sons, Inc. (2009).

Google Scholar

14. Yu X, Gen M. Introduction to Evolutionary Algorithms. London: Springer-Verlag London Limited (2010). doi: 10.1109/ICCIE.2010.5668407

Crossref Full Text | Google Scholar

15. Wang J, Wang WC, Hu XX, Qiu L, Zang HF. Black-winged kite algorithm: a nature-inspired meta-heuristic for solving benchmark functions and engineering problems. Artif Intell Rev. (2024) 57:98. doi: 10.1007/s10462-024-10723-4

Crossref Full Text | Google Scholar

16. Zhou H, Alexander D, Lange K. A quasi-Newton acceleration for high-dimensional optimization algorithms. Stat Comput. (2011) 21:261–73. doi: 10.1007/s11222-009-9166-3

PubMed Abstract | Crossref Full Text | Google Scholar

17. Jameel MS, Basheer GT, Al-Bayati AY, Algamal ZY. Parameter estimation of a truncated regression model based on improving numerical optimization algorithms. J Phys Conf Ser. (2021) 1897:012059. doi: 10.1088/1742-6596/1897/1/012059

Crossref Full Text | Google Scholar

18. Hwang WH, Stoklosa J, Wang CY. Population size estimation using zero-truncated Poisson regression with measurement error. J Agric Biol Environ Stat. (2022) 27:303–20. doi: 10.1007/s13253-021-00481-z

PubMed Abstract | Crossref Full Text | Google Scholar

19. Green W. Econometric Analysis. 5th ed. Upper Saddle River, NJ: Prentice Hall (2003).

Google Scholar

20. Demaris A. Regression with Social Data Modeling Continuous and Limited Response Variables. New York, NY: John Wiley & Sons, Inc. (2004). doi: 10.1002/0471677566

Crossref Full Text | Google Scholar

21. Heij C, de Boer P, Franses PH, Kloek T, van Dijk HK. Econometric Methods with Applications in Business and Economics. Oxford: Oxford University Press (2004).

Google Scholar

22. Umar MA, Jimoh K, Yahya WB. A Note on the Applications of some Zero Truncated. Professional Statisticians Society of Nigeria, 3 (2019).

Google Scholar

23. Li X, Sun Y, Tian G, Liang J, Shi J. Mean regression model for the zero-truncated Poisson distribution and its generalization. Comput Stat Data Anal. (2023) 179:107650. doi: 10.1016/j.csda.2022.107650

Crossref Full Text | Google Scholar

24. Rahman S. Truncated Distributions and their Applications, 2004-2005 (2004).

Google Scholar

25. Mwandigha LM, Fraser KJ, Racine-Poon A, Mouksassi MS, Ghani AC. Power calculations for cluster randomized trials (CRTs) with right-truncated Poisson-distributed outcomes: a motivating example from a malaria vector control trial. Int J Epidemiol. (2020) 49:954–62. doi: 10.1093/ije/dyz277

PubMed Abstract | Crossref Full Text | Google Scholar

26. Suaiee AM. A Double Truncated Poisson Regression Model with Random Effects. Greeley, CO: University of Northern Colorado (2013).

Google Scholar

27. Karlsson M. Estimators of regression parameters for truncated and censored data. Metrika (2006) 63:329–41. doi: 10.1007/s00184-005-0023-x

Crossref Full Text | Google Scholar

28. Newey WK. Conditional moment restrictions in censored and truncated regression models. Econ Theory (2001) 17:863–88. doi: 10.1017/S0266466601175018

Crossref Full Text | Google Scholar

29. Irshad M, Chesneau C, Shibu DS, Monisha M, Maya R. Lagrangian zero truncated Poisson distribution: properties regression model and applications. Symmetry (2022) 14:1775. doi: 10.3390/sym14091775

Crossref Full Text | Google Scholar

30. Sabri Al-zubaidi M, Abdulah EK. Comparison of estimation methods for zero truncated poisson regression model. J Econ Admin Sci. (2024) 30:492–508. doi: 10.33095/7bt1f714

Crossref Full Text | Google Scholar

31. Kumar S, Singh BP, Madhusudan JV. Factors influencing migration: application of truncated poisson regression. Int J Adv Sci Technol. (2020) 29:9112–20.

Google Scholar

32. Hussain EA, Al-Shallawi ANS, Saied HA. Using maximum likelihood method to estimate parameters of the linear regression T truncated model. NTU J Pure Sci. (2022) 4:26–34. doi: 10.56286/ntujps.v1i4.313

Crossref Full Text | Google Scholar

33. Luksan L, Spedicatob E. Variable metric methods for unconstrained optimization. J Comput Appl Math. (2000) 124:61–95. doi: 10.1016/S0377-0427(00)00420-9

Crossref Full Text | Google Scholar

34. Zhang Z, Wang X, Yue Y. Heuristic optimization algorithm of black-winged kite fused with osprey and its engineering application. Biomimetics (2024) 9:595. doi: 10.3390/biomimetics9100595

PubMed Abstract | Crossref Full Text | Google Scholar

35. Zhao M, Su Z, Hua Z, Zhao C. Improved black-winged kite algorithm based on chaotic mapping and adversarial learning. J Phys Conf Ser. (2024) 2898:012040. doi: 10.1088/1742-6596/2898/1/012040

Crossref Full Text | Google Scholar

36. David FN, Johnson NL. The truncated poisson. Biometrics. (1952) 8:275–85.

Google Scholar

Keywords: BFGS, black kite optimization algorithm, count data, meta-heuristic optimization algorithms, truncated Poisson regression model

Citation: Basheer GT, Waleed Mahmood S and Algamal ZY (2026) Improving parameters estimation of a truncated Poisson regression model based on meta-heuristic optimization algorithms. Front. Appl. Math. Stat. 12:1744058. doi: 10.3389/fams.2026.1744058

Received: 11 November 2025; Revised: 09 January 2026;
Accepted: 12 January 2026; Published: 04 February 2026.

Edited by:

Appanah Rao Appadu, University of the Western Cape, South Africa

Reviewed by:

Yannick Tangman, University of Mauritius, Mauritius
Faiza Sami, Govt Gordon Graduate College Rawalpindi, Pakistan

Copyright © 2026 Basheer, Waleed Mahmood and Algamal. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zakariya Yahya Algamal, emFrYXJpeWEuYWxnYW1hbEB1b21vc3VsLmVkdS5pcQ==

^†ORCID: Zakariya Yahya Algamal orcid.org/0000-0002-0229-7958

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.