- School of Computer Engineering, Kalinga Institute of Industrial Technology (KIIT) Deemed to be University, Bhubaneswar, Odisha, India
Introduction: This work presents an adaptive ant colony (AdCO) framework for dynamic task management in heterogeneous Non-Terrestrial Network–Internet of Things (NTN-IoT) systems integrating Unmanned Aerial Vehicles (UAVs) and Low Earth Orbit (LEO) satellites. The framework addresses key challenges such as stochastic mobility, intermittent connectivity, and latency-sensitive operations common in large-scale IoT deployments.
Methods: The proposed approach employs adaptive pheromone learning, heuristic control, and multi-timescale scheduling. It follows a hierarchical co-optimization strategy, where UAV swarms perform edge-side task allocation while LEO satellites handle relay scheduling during orbital passes. Event-triggered pheromone resets and distributionally robust cost modeling are introduced to maintain stability and adaptability under dynamic network conditions.
Results: Simulation results demonstrate superior performance compared to classical Ant Colony Optimization (ACO) and recent meta-heuristic methods. The proposed model achieves higher task completion ratios, reduced end-to-end latency, and enhanced energy-normalized throughput across different orbital configurations, traffic patterns, and link failures.
Discussion: The findings confirm the efficiency and resilience of the proposed framework in NTN-IoT operations. Its adaptability makes it suitable for critical applications such as disaster response, precision agriculture, and maritime monitoring, where real-time coordination and reliability are essential.
1 Introduction
Non-terrestrial IoT (NTN) enhances the terrestrial based internet of things (IoT) using unmanned aerial vehicle (UAV) swarms and LEO satellite constellations. It is showing growing potential as a scalable technology for wide-area IoT applications. NTN provides an opportunity for on-demand sensing, in-situ edge computing and beyond-line-of-sight data relaying in areas with such limited or disrupted terrestrial infrastructure (Kahraman et al., 2023). However, several challenges exist. Dynamic link topology, time-varying link budgets, fragmented orbital pass windows, and the trade-offs between energy, latency, and reliability render global coordinated tasking across multiple timescales a challenging optimization problem. Classical swarm intelligence techniques like ant colony optimization (ACO) can cope with static or slowly changing routing problems. However, the fixed parameterization and the narrow detection range of changes usually leads to pheromone stagnation or late re-planning when the environment changes suddenly (Deng et al., 2022). To address this issue, an adaptive colony (AdCO) optimization model is presented to tackle these shortcomings. The work has been compared with three kinds of coordinated approaches: (i) standard ACO with fixed powers and evaporation, (ii) bounded-pheromone variants (such as MAX-MIN ACO for better control of exploitation), and (iii) a policy heuristic scheduler that combines deadline-aware prioritization and proximity-based assignment (Zhang et al., 2025). This analysis decouples the impact of the event-triggered adaptation, the connectivity-aware heuristics, and the robust cost shaping on timeliness, energy efficiency, and tail-latency robustness for realistic UAV and low Earth orbit (LEO) dynamics.
Combining LEO satellite constellations with UAV Swarm provides a solution to resilient and large area IoT service. Such integration is needed in territories where the terrestrial infrastructure is not possible, or is not reliable, or is too costly (Mai et al., 2023). Disaster-stricken areas, marine corridors, farming fields and isolated industry are all examples of such settings. In such scenarios, sensing tasks are spatio-temporarily distributed and bursty. Opportunities for connectivity are subject to orbital visibility, the condition of the atmosphere and availability of spectrum (Jubair et al., 2025). UAVs provide mobility and fast deployment as well as edge computing, but are subject to limitations in energy storage, flight dynamics, and onboard computing power. On one hand, LEOs satellites are able to provide wide-area relays and backhaul services, while on the other hand, they have a sporadic contact window, the ability to induce velocity Doppler variations and competition for bandwidth with traffic classes. Controlling both the tasks’ execution and the data delivery over UAV and LEO layers is achieved by managing two synchronized dynamical processes (Zhu et al., 2023). The first refers to spatiotemporal task allocation and motion planning in UAV network. The second such is contact-aware relay scheduling that adapts with the varying LEO pass structure. Figure 1 describes the system architecture that includes UAVs, LEOs and ground segments, the data flows, the scheduling layers and the control loop.
In this system, the decision space is combinatorial and non-stationary. Estimates of task priority change with time, shifting according to deadlines and arrivals. Quality varies with elevation angles (EAs) and mobility directions. The buffer occupancy, the energy level, and the computational state all vary on different time scales as well. There have been mostly two types of approaches to managing these interacting dynamics: classical (suboptimal) techniques, including static task assignment, single-timescale routing, and topology-agnostic heuristics, and optimal solutions with simplified and static dynamics. They often lead to a tradeoff between high re-planning overhead, and performance loss due to out-of-date policies (Li et al., 2025b). The operational goals are multi-criteria in the sense. These objectives involve low end-to-end latency (i.e., long life time regardless of the number of deadlines missed), low deadline misses, strict energy consumption, and link quality in terms of uncertain channel conditions and outages (Hasanain et al., 2025). Partial observability further complicates the problem. Ephemeris predictions or channel estimates can be frequently unreliable. Sudden changes in the working environment (e.g., sudden task bursts or unexpected satellite pass losses) also account for another challenge (Peng et al., 2025). So, an efficient coordinator must be able to provide a fast decision-making on the fly and be adaptive and robust. It has to support immediate reconfiguration and be free of local oscillations such that low performance is not realized in very dynamic and resource-constrained NTN-IoT scenarios with very low end-to-end latency sensitivity (e.g., UAV deployment) (Behjati et al., 2025).
1.1 Research gap and proposed contribution
Existing work falls into three key categories to leverage UAV satellite communications for IoT coordination, but this existing work suffers from the challenges of fast changing NTN conditions (Gu et al., 2023). The first one is of classical swarm-based optimizers like fixed-parameter ACO and bounded-pheromone variants. Such methods work well for combinatorial search in static or slowly varying routing and task allocation. But they have poor adaptability to the scripting topology changing and sudden contact loss and bursty task arrival time. Pheromones stagnate and do not recover in time when operation conditions experience a large change (Lozano-Cuadra et al., 2025). The second type consists of heuristic schedulers which mix deadline-aware rules with location-oriented or load-balancing heuristics. These methods have low computation complexity, however, these methods do not have any capability to predict the link variation, led by the visibility. As a consequence, their performance deteriorates when LEO contact opportunities or channel conditions are not as good as expected (Lozano-Cuadra and Soret, 2024). The third type is learning-based methods, such as the reinforcement learning. These methods are adaptive under non-stationary environments but require sufficient amounts of training data and are not capable of dealing with the distribution shift between different orbits and terrains. They could also be inadequate to satisfy real-time decision making criteria in the absence of watching for sample efficiency and safety guarantees (Maric et al., 2025). However, three important gaps persist. First, it is relatively lacking in the cross-timescale coordination support to optimize UAV task scheduling and LEO relay scheduling. Second, tail risks and outages are not fully taken into account in the objective formulation in the current methods. Third, the majority of existing algorithms do not include events mechanisms, which keep the population from stagnation while letting the system converge in fast changing environments (Panaitopol et al., 2023).
In this respect, the proposed AdCO is designed to fill this gap by introducing three main differences as introduced below with respect to existing methods. The latter integrates a fast loop for sequencing the UAV tasks, and a slower loop to solve the LEO relay scheduling problem. This temporal alignment adjusts the decision making rate with the physical dynamics of the UAV mobility and satellite pass windows (Yosri et al., 2024). Secondly, it uses event-triggered partial pheromone resets using online drift detection. This mechanism allows to explore again in a controlled way in case of contact geometry or of traffic patterns change and at the same time to retain useful historical information. Third, it integrates distributionally robust cost shaping with risk-sensitive penalties. These punishments encourage meeting deadlines instead of reliability in unreliable links, which ultimately reduces tail latency and failures due to outages rather than serving the mean (Liu et al., 2024). These features combined return a coordinator, which is aware of the connectivity, risk-sensitive, in addition to being able to remain responsive. Experimental validation compares AdCO against three known baselines: fixed-parameter ACO, MAX-MIN ACO, and an heuristic scheduler with a combination of deadlines and proximity. The numerical findings quantitatively justify the advantages of multi-time scale learnability, event-triggered restoration, and robust objective shaping on the fly in dynamic NTN-IoT regimes (Bazrafkan et al., 2025).
This study is conducted entirely within a controlled simulation environment by setting using standards and repeatable components. Orbit dynamics and visibility windows are calculated based on publicly accessible Two-Line Element (TLE) datasets. These are propagated using an Simplified General Perturbations 4 (SGP4) model to produce time-stamped elevation and Doppler and line-of-sight contact profiles over the target region (Kohani and Zong, 2019). The mobility of UAVs is expressed by a 3D kinematic model, which takes into account energy-aware propulsion curves. These are the task fields, obtained from spatial point processes with intensities which are configurable and can be either clumped or uniform. The model has the flexibility to tune the tasks’ arrival rate, deadlines, data size and computational workload (Zhao et al., 2021). The communications layer utilizes a path loss model in the presence of elevation-dependent path with log-normal shadowing and stochastic outage model. This setup simulates UAV-LEO uplinks and satellite ground downlinks. Furthermore, capacity limitations and sharing are modelled with rate envelopes that depend on time.
The paper is organized into seven sections to maintain clarity and cohesion. Section 1 introduces the motivation for integrating UAV swarms with LEO satellites, emphasizing coordination challenges and research gaps addressed by the adaptive colony optimization framework. Section 2 reviews related work on UAV task allocation, integrated networks, colony optimization strategies, heuristic schedulers, and learning-based coordinators, while highlighting their limitations under dynamic NTN conditions. Section 3 presents the system model and problem formulation, defining entities, visibility and capacity constraints, mobility and energy models, traffic parameters, decision variables, and the distributionally robust objective. Section 4 describes the proposed method, including solution encoding, connectivity-aware heuristics, adaptive transition rules, event-triggered pheromone resets, and the bi-level scheduling approach. Section 5 details the simulation environment, experimental setup, baseline methods, evaluation metrics, and reproducibility settings. It also reports results and discussion, comparing the proposed method with representative coordinators and presenting ablation studies under outages, workload bursts, and contact variability. Finally, Section 6 concludes with key findings, practical implications, limitations, and directions for future research.
2 Literature review
Early research on UAV swarm task allocation primarily relied on centralized optimization methods such as mixed-integer linear programming, Hungarian algorithms, and auction-based mechanisms (Pham et al., 2020). Traditional approaches, including the Consensus-Based Bundle Algorithm and its variants, performed well in static settings with predetermined task sets and known agent capabilities. Deterministic methods offered mathematically guaranteed convergence and optimality for problems with fixed constraints (Wang et al., 2024). Hungarian-based strategies, such as the Decentralized Hungarian Algorithm, further ensured efficient polynomial-time complexity for bipartite matching structures (Li et al., 2025a). However, these classical techniques face major challenges in dynamic and non-terrestrial environments. Their static parameterization makes them ill-suited for handling time-varying topologies, intermittent connectivity, and stochastic task arrivals (Chen et al., 2018). Moreover, computational costs grow rapidly with swarm size and task complexity, limiting real-time applicability. Centralized designs also introduce single points of failure and demand extensive inter-agent communication, which is impractical in bandwidth-constrained UAV–LEO satellite operations.
Drawbacks of classical optimization techniques have encouraged the utilization of metaheuristic heuristic methods for UAV task assignment also. It was established that ACO, Particle Swarm Optimization (PSO), and Genetic Algorithms (GA) were the mostly used approaches in the existing study (Divband Soorati et al., 2019). Where traditional ACO implementations have met with considerable success in solving combinatorial routing problems using pheromone-based communication and distributed decision making to arrive at near-optimal solutions (Bonyadi et al., 2014). In the same vein, PSO variants including multi-objective and hybrid models were effective in handling conflicting demands such as energy efficiency and deadline satisfaction in Zhu et al. (2024). These swarm intelligence approaches are by nature adapted to parallelism, scalability and robustness against the failure of individual members. Their search strategy in a probabilistic framework encourages exploration of various solution spaces, which gradients methodologies tend to fall into local minima. However, classic metaheuristics are still characterized by static parameterization, which restrains versatility in dynamic operational conditions (Zhang et al., 2024). These fixed rates often deteriorate their performance under changes, for they may find themselves stuck in prematurely converged or stagnant state. In addition, the absence of adaptation strategies considering the environment prevents these solutions from dealing with regime shifts, orbital pass variability, and non-stationary traffic demands. Among the latest ones, parameter-adaptive ACO with dynamic hybrid mechanisms attempts to largely overcome this issue but still acts more reactively than predictively (Pirandola, 2021).
Recent investigations focus on the increasing use of machine learning in realizing enhanced UAV swarm coordination. This is where the inhomogeneous reinforcement learning comes in. The multi agent reinforcement learning provides the means for UAVs to learn the collaboration behavior by the trial and error interactions. These mechanisms permit adjustments to be made to changing environmental conditions without reprogramming (De Forges de Parny et al., 2023). Recent works utilizing Deep-Q Networks and Actor Critic based algorithms have achieved competitive performance in rescue and target tracking, and area coverage missions. Methods based on learning work well in environments where only a partial view of the world is available, in non stationary conditions, and with complex multi agent interactions. They are ideal for long-term missions with changing objectives as they have the capacity of continuous learning and adaptation (Miller, 2025). Reinforcement learning libraries could also support more complex state representations, where graph like observations and attention mechanisms are used to represent inter agent relations and dynamic interactions with the environment. These methods, however, are confronted with sample efficiency issues. They need large amount of training data, which is scarce in safety-critical applications (Zengshan et al., 2025). Due to black-box policies, verification and certification used in aerospace side are challenged also. Convergence guarantees are often weak or non-existent and initial deployment may involve sub-optimal or unsafe behaviour during exploration (Soret et al., 2024).
Distributed optimization methods have been proposed as an intermediate option between the centralized control and the full autonomy. These techniques use consensus algorithms, game theory, distributed constraints optimization, for the management of the coordination of the UAV swarm (Melillos et al., 2019). The Distributed Greedy Bundles Algorithm is a substantial advance for submodular maximization for MAS. It has formal and approximation guarantees while working in polynomial time (Wang et al., 2021). Game theoretic models unify UAV interactions as two basic or general games in order to search for the Nash equilibrium or generalised Nash equilibrium with resource constraints. Event-activated communication schemes reduce the overall usage of bandwidth, while maintain convergence properties and this is useful in UAV to satellite networks with intermittent connectivity (Li C. et al., 2021). These decentralised methods provide formal convergence guarantees, handle agent heterogeneity well, and scale well to swarm size. Moreover, the decentralized nature of these SLAMs make them more capable of handling communication failures and agent drops, both of which are prevalent in aerial agent operation. However, methods relying on consensus often converge slowly due to the needs to have multiple rounds of communication to reach an agreement (Li Y. et al., 2021). Game theoretic solutions can lead to inefficient equilibria, in particular in adversarial settings. Furthermore, the rationality assumption about the agents behavior may not be satisfied due to hardware faults, communication delays or adversarial environments.
The development of the uncertainty effects of UAV satellite scenarios has resulted in growing efforts to address robust optimization and distributionally robust optimization (Stodola et al., 2025). Polyhedral convex sets are used to describe parameter uncertainties in uncertain resource allocation frameworks. Such frameworks take care of errors due to channel estimation, inaccuracy in dynamic prediction and changes in traffic flow. Distributionally robust optimization methods use ambiguity sets and coherent risk measures like Conditional Value at Risk to obtain performance bounds that are robust to shifts in the distribution (Fakoor et al., 2016). These methods directly naturally include the model mismatches and provide worst case performance outcomes and enhanced tail risk management. By including uncertainty quantification we can make trade-off between nominal performance and robustness. Distributionally robust methods mitigate the conservatism of bounded uncertainty sets while still being computationally tractable (Wei Jiang et al., 2024). However, strong optimization generally yields conservative solutions that favor the worst case protection, while sacrificing the average case performance. The quality of solutions is sensitive to the choice of uncertainty or ambiguity sets, which do not always capture actual operational situations. They require a more complex processing that demands a very high amount of computational power and thus they are not suitable for real time application. Moreover, conservative solutions potentially can be over-conservative, when uncertainty estimates are not accurate or when systems are functioning properly for long periods of time (Basil et al., 2025).
In recent work, the multi timescale nature of UAV satellite coordination is emphasised, as fast UAV dynamics and slower orbital mechanics imply having different planning horizons (Yang et al., 2024) Two-level optimization methods optimize temporal scales away from short term resource allocation (down) and long term trajectory planning (up), thus making it possible to decompose complicated coordination tasks effectively. Hierarchical structures break the entire mission task into several layers, such as high level mission planning, medium level task assignment, and low level trajectory implementation. Each layer acts on multiple time scales (Zhou and Zheng, 2013). Multi objective optimization paradigms handle trade off among conflicting objectives, such as short term power consumption and long term mission success. These techniques match algorithmic effort with the dynamic of the system, to decrease computation cost by maintaining solution quality (Toppen et al., 2017). The layered decomposition enables specialized algorithm design for multiple timescales and easy integration into legacy control systems. However, suboptimal solutions can be obtained with hierarchical strategies as a result of decomposition approximations. Flow of information loss between the layers may lead to the inconsistency in the decision making and compromise the overall performance Han et al. (2024). Coordinating over multiple time scales must also be mindful of what the interface should look like and possibly add another level of communication between machines.
Contemporary research increasingly focuses on hybrid approaches that combine classical optimization, metaheuristics, and machine learning (Li et al., 2024). Hybrid metaheuristic algorithms integrate multiple optimization techniques, such as merging ant colony optimization with local search or combining genetic algorithms with particle swarm optimization. Learning augmented optimization applies machine learning to guide traditional optimizers by predicting good initial solutions or dynamically adjusting parameters. Reinforcement learning frameworks incorporate classical control strategies to ensure safety while enabling adaptive behavior learning. These hybrid methods exploit the complementary strengths of different techniques, often achieving better performance than individual approaches (Liu et al., 2022). Classical components offer convergence guarantees and optimal solutions for well defined subproblems, while learning components enhance adaptability and handle uncertainties. The modular architecture supports testing, verification, and incremental deployment. However, hybrid systems increase complexity in design, implementation, and maintenance (Wang et al., 2022). Interactions between components can cause unexpected behaviors or performance degradation. Integration challenges also arise from differences in computational requirements, convergence speeds, and solution representations across hybrid modules.
The dynamic nature of UAV satellite environments requires real time adaptation, leading to advances in dynamic task scheduling and event triggered optimization. Dynamic priority mechanisms allow task importance to change based on environmental conditions, mission progress, or external events (Paek et al., 2019). Event triggered algorithms initiate optimization only when significant changes occur, reducing computational costs while maintaining responsiveness. Online learning methods continuously update models and policies using observed system behavior and performance feedback (He et al., 2024). Adaptive resource allocation techniques adjust to varying system conditions, load fluctuations, and constraint modifications. These dynamic strategies enable responsive adaptation while preserving computational efficiency (Wang et al., 2023). Event triggered mechanisms reduce communication and computation demands compared to continuous optimization. Online learning supports gradual performance improvements and adaptation to long term system changes. However, event triggered systems require careful selection of triggering conditions to maintain stability. Excessively rapid adaptation can cause oscillatory behavior or instability in noisy or transient environments (Zhou et al., 2022). Online learning methods may also face issues such as catastrophic forgetting and slow convergence under rapidly changing conditions.
Table 1 presents trade offs on UAV–LEO coordination mechanisms and the importance of balanced performance. Traditional optimization approaches, such as MILP and Hungarian algorithms, achieve strong optimality but are not adaptive, scalable, and computationally efficient in real time, which hinders the applications in dynamic environments. Classical ACO and PSO provide moderate results and miss the generality of the approach when the environment is dynamic since parameters are constant. Machine learning especially multi agent reinforcement learning exhibit high flexibility and robustness at the cost of high computational complexity, and weaker optimality guarantees. Distributed consensus and robustified optimization T Better scalability and uncertainty handling but worse reactivity and optimality. No balance is reached by hybrid metaheuristics andmulti timescale frameworks as these have high scalability and are NTN suitable, while at the same time they have adaptability and optimality constraints. Event-Led Scheduling is high achieving in real time but compromises predictive accuracy. The resulting adaptive colony optimization model integrates event triggered adaptability, robust cost shaping and connectivity aware heuristics, and reaches a trade-off between solution quality and computational complexity in dynamic UAV–LEO coordination.
3 Proposed work
The proposed AdCO framework overcomes the primary shortcomings of current UAV and LEO coordination schemes. It combines adaptation, robust optimization and multi-timescale coordination in one framework. Traditional optimization methods are often notoriously plagued by static parameterization and low real-time efficiency. On the other hand, the AdCO uses event-triggered parameters update and connectivity-based heuristics. Such techniques make fast responses to topology changes, orbital pass fluctuation and traffic non-stationarity with convergence guarantees. The framework addresses the slow adaptation and pheromone stagnation of fixed-parameter metaheuristics. It does so by dynamically modulating exploration and exploitation by means of feedback-generated learning coefficients and drift-sensitive pheromone partial resets. This design keeps all the time an exploration according to changing operating conditions. AdCO combats the sample inefficiency and black-box nature of RL methods with domain-specific heuristics inspired by orbital mechanics and communication theory. This has the benefit of making decisions more interpretable, and reduces the training burden. A bi-level scheduling structure is adopted to answer the problem of the scalability issues in consensus based methods. Fast UAV task sequence is decoupled with slow LEO relay coordination, making it very low communication overhead while still global coordination. The proposed framework also mitigates the conservatism of robust optimization with distributionally robust cost shaping. This method guards against the truly disastrous risks, but does not become too pessimistic. Last but not least, AdCO alleviates the drawbacks with event-triggered scheduling. This includes predictive visibility forecasting and proactive pheromone structuring ensuring that the framework may predict even connectivity windows and not only reacting to them. Algorithm 1 summarizes the overall procedure of the AdCO algorithm. Figure A1 illustrates the flowchart of the proposed model.
3.1 Simulation environment
A discrete event simulation environment written in Python is used in the evaluation framework. It uses accepted models and benchmark datasets to guarantee the repeatability and the reality of the UAV and LEO coordination algorithms assessment. The orbital mechanics layer is based on TLEs supplied by NORAD and CelesTrak. These TLEs are propagated using the SGP4 algorithm via the Python SGP4 library. This procedure creates time-tagged time visibility windows, elevation profiles and Doppler shift patterns for the specific region under consideration. The mobility of the UAVs is represented with 3D kinematic equations dedicated to energy-aware flight dynamics. The model combines physical realistic propulsion power demand curves, maximum velocity limits, and battery discharge laws grounded on commercial UAV specifications. The communications system is based on ITU R P.619 recommended elevation-dependent path loss models. It also includes log-normal shadowing with
Task distribution applies spatial point processes, such as clustered hotspot distributions and uniformly random placement. Arrival rates are tunable and are modeled after Poisson and ON-OFF bursty patterns. These task setups are based on realistic mission settings such as disaster response and agricultural monitoring. The network structure is changing as a dynamic graph by time-varying adjacency matrices. They capture UAV-LEO contact windows, which are intermittent, ground station visibility, and inter-UAV communication slots. The communication is limited due to range and line of sight. The energy model is for both the aircraft’s power for forward and turning flight as well as the computing power for onboard processing and communication power for the data packets traveling between ground and air. This allows for a more accurate assessment of trade-offs in energy efficiency. Performance statistics are calculated using statistical analysis on the NumPy and SciPy libraries. Bootstrap confidence intervals and Monte Carlo sampling are implemented over 50–100 independent random seeds. These approaches ensure the statistical significance and reproducibility of the results. Table 2 the basic simulation parameters and the assigned ranges are reported. It is defined in terms of the region size, the size of the swarm and the subset of the satellite constellation. Workload characteristics are determined by task arrival rates and deadline typologies. The communication environment is modeled in terms of height thresholds and shadowing factors. The limited energy resource of UAV is characterized by battery capacity ranges. Table 2 additionally, summarizes the number of trees used, the time in which each virtual experiment is executed and the number of Monte Carlo runs. These criteria provide a structured approach to systematically comparing algorithm performance across a range of operational conditions. Figure 2 illustrates the simulation environment.
4 Problem formulation
The problem formulation provides a structured platform where the mathematical depiction of the system entities, performance indices and the operational constraints helps in synthesizing the AdCO algorithm. Table 3 presents the notations used. Binary and continuous decision variables are adopted in the formulation to model task allocation and relay scheduling. The multi-objective function combines latency, energy and reliability degradation factors. A combination of linear and nonlinear constraints are required to meet capacity, visibility, and deadline constraints. All parts of the formulation, notation and decision variables, the objectives as well as the constraints are directly related to the corresponding AdCO elements so that pheromone update, heuristics evaluations and event-driven adaptations connect naturally to the mathematical model.
4.1 Latency decomposition
The end-to-end latency of task
4.1.1 Energy model
A composite energy model is developed to account for the main contributors to onboard energy consumption in UAV-assisted LEO relaying missions, including propulsion, computation, and communication. For each UAV
4.1.1.1 Propulsion energy
Let
4.1.1.2 Computation energy
Onboard processing energy is driven by executed CPU cycles for assigned tasks. Let
4.1.1.3 Communication energy
Communication energy considers the cost of uplink transmission from UAV to satellite, optional downlink reception overhead, and protocol signaling. Let
4.1.1.4 Total energy and feasibility
Combining the above contributions yields the per-UAV total energy as presented in Equation 14. Each UAV must satisfy the battery feasibility constraint by using Equation 15. Here
4.1.2 Reliability model
End-to-end reliability for task delivery in a UAV–LEO relaying scenario and presents a tractable penalty function for the purpose of optimization. Reliability represents the probability that all task-related sequence of data bits are transported from the executing UAV to a satellite (and then to the ground, if the model includes it) in the planning horizon, given stochastic link impairments. Reliability is directly related to the fractional relay decisions, the instant link qualities, and the temporal visibility in the model.
4.1.2.1 Per-link success probability
Consider the uplink between UAV
4.1.2.2 Task-level success probability with fractional relaying
Let
4.1.2.3 Negative log-reliability as an additive penalty
To incorporate reliability into a convex-amenable objective, it is convenient to work with the negative logarithm of the success probability by using Equation 18. The term
4.1.2.4 Visibility and feasibility
Reliability contributions are meaningful only when the link is visible and has nonzero capacity. Visibility is enforced by using Equation 19. Where,
4.1.2.5 Risk-sensitive shaping via CVaR
In highly variable environments, optimizing only the expected negative log-reliability may underweight rare but severe outages. A risk-sensitive shaping can be introduced using Conditional Value-at-Risk (CVaR) on the reliability penalty
4.1.2.6 Downlink reliability and end-to-end extension
If the satellite-to-ground segment is explicitly modeled, define an analogous error probability
5 Result, validation and discussion
The proposed AdCO framework is proposed in this work for dynamic coordination between UAVs and LEO satellites. The design goal of the framework is to trade-off latency, energy and reliability under constraints of computing, link visibility, link capacity and battery. The approach consists of four major steps (1) contact-aware task sequencing of UAVs based on pheromone-guided strategies with connectivity and deadline dependent heuristics (2) fractional relay scheduling during satellite passes under visibility and capacity constraints (3) event-triggered stabilization to address environmental or network changes (4) and risk-sensitive reliability shaping by means of Conditional Value at Risk (CVaR). Performance comparison To evaluate the performance, the proposed AdCO is compared with three baselines, the Greedy Proximity–Deadline (GPD) scheduler, the Static Heuristic ACO (SH-ACO) which does not consider the adaption to drift and the risk in the environment and the Link-Quality-Only (LQO) that only focuses on maximizing the throughput without taking into account the energy and the deadlines. These are compared to show the strengths of AdCO when it comes to better spatio-temporal decision-making than GPD, better adaptation to environment than SH-ACO and better trade-offs between latency, energy and reliability than LQO.
5.1 Contact-aware task sequencing
Contact-aware pheromone-guided sequencing minimised the task latency as a whole by favouring tasks that are in closer proximity in space and better in line with satellite overhead. It discouraged the use of low-energy-margin and negative-deadline-slack tasks, and thus encouraged to effectively make use of resources. The heuristic UAV trajectories in such windows tends to connect relay locations with relay-aligned path in which the data generation is synchronized with high-rate satellite pass. This resulted in less queuing, less retransmissions and less deadline misses. We have experimentally demonstrated that the latency distribution shifts distinctly toward the left, while more tasks meet deadlines without increasing energy consumption. These results in Figure 3 illustrates the importance of integrating spatial vicinity, predictive link opportunities and deadline slack to enable efficient and time-critical UAV–satellite cooperation. Figure 3 illustrates presents the latency vs. deadline-feasible rate under contact-aware task sequencing compared to a non-contact-aware baseline. Lower latency and higher feasible rate indicate superior performance.
Figure 3. Latency vs. deadline-feasible rate under contact-aware task sequencing compared to a non-contact-aware baseline.
5.2 Fractional relay scheduling over time-varying passes
Partial mapping of task data to multiple spatially visible satellites enhanced better utilization of the few pass capacities and alleviated bottlenecks observed in the one-link routing under the changing elevation and Doppler effects. The scheduler sent more traffic through high-rate and low-loss links, based on instantaneous visibility, and per-slot capacity limits enabling the scheduler to provide better throughput and shorter relay-induced latency, while not violating the energy constraints. Fractional scheduling achieved more b/slots and decreased the buffer overflow at the peak of contention, compared with the single-link, winner-take-all approach. This also lead to an improved deadline compliance and fewer retransmissions when link quality varied over a satellite pass. Figure 4 present the real-time throughput simulation over a satellite pass under fractional relay scheduling vs. single-link routing. Fractional scheduling better tracks time-varying capacity and visibility, leading to improved throughput during peak windows and more stable performance.
5.3 Event-triggered stabilization under regime shifts
The event-triggered stabilization procedure takes small perturbations induced by abrupt regime changes, like caused by visibility losses, sudden rate declines during low-elevation orbit passage, or load surges, and partially resets pheromone values to the former baseline. It avoids stagnation in search, overfitting to obsoleted paths, and deep deadline violations. Once a drift threshold is reached, the controller applies a soft reset to the exploration memory. It does this while still saving the useful historical patterns. The system can then quickly recover from infeasible scheduling. At the same time, transient latency peaks are reduced. The degree of missed deadlines also decreases, both during the disruption and after it. When compared to a continuously learning, non-event-driven learner, the event-triggered policy performs better. It reduces overshoot and oscillations. It also speeds up the return to pre-shift performance. Additionally, it lowers the variability of slot-level costs. This leads to more robust and stable coordination in a dynamic environment. Figure 5 illustrates the transient response to a regime shift (gray band) with and without event-triggered pheromone reset. The event-triggered policy reduces peak overshoot and settling time, stabilizing the per-slot objective more quickly. Expanded data points provide a finer resolution of the transient behavior.
Figure 5. Transient response to a regime shift (gray band) with and without event-triggered pheromone reset.
5.4 Risk-sensitive reliability shaping via CVaR
The addition of CVaR- optimal shaping of reliability caused the resource allocation to move away from links and time slots that were acceptably reliable on average but had a high-tailed loss performance during low-elevation passes or interference-rich epochs. By penalizing the tail of the negative log-success distribution at a desired confidence level, the scheduler proactively spread flows over more reliable contact opportunities and also postponed non-urgent transmissions from high-risk windows. It reduced the occurance of heavy packet loss without increasing mean latency and energy consumption considerably. Against an expectation-only reliability objective, the CVaR shaping decreased the probability of task failure in the worst decile and minimized variation in end-to-end success rates over simulation runs—representing a better protection against rare but high-cost outages. Figure 6 plots the tail reliability loss as a function of tail probability level, revealing that the CVaR-shaped allocation achieves great gain of robustness in the extreme tail level than the naive allocation based on the expectation only.
5.5 Overall mission effectiveness score
This sub-section defines an Overall Mission Effectiveness Score (OMES) to summarize performance. Four metrics are included: latency L (lower is better), deadline-feasible rate F (higher is better), energy per delivered bit E (lower is better), and end-to-end success rate R (higher is better). For each scenario s and metric m it normalizes to across all compared methods in that scenario. For lower-is-better metrics:
5.5.1 Metric normalization and composite score
For lower-is-better metrics:
For higher-is-better metrics:
This work uses
Default weights reflect mission priorities:
Sensitivity analysis varies each weight by
The OMES results provides a single view of effectiveness that combines latency, deadline adherence, energy per bit, and end-to-end success into one normalized score (Equations 23–25). Across all scenarios, AdCO attains the highest OMES, indicating balanced gains rather than an improvement on only one metric. In lighter loads, the score advantage reflects earlier findings on lower median latency and higher deadline satisfaction. Under heavier loads and contact variability, the advantage persists, which suggests that event triggered stabilization and fractional relay scheduling keep performance stable when contention increases. The Figure 7 shows that rank ordering does not change when weights are perturbed (Equation 26). AdCO’s score varies only slightly across the tested weight sets, which indicates robustness to different mission priorities. Baselines lose ground for different reasons. The greedy proximity deadline scheduler does not align execution with satellite windows, so its composite score drops as contention rises. The static heuristic ACO lacks drift handling, which hurts after regime shifts. The link quality only allocate focuses on instantaneous throughput and pays more in energy and tail reliability, which lowers its composite score even when mean rate is good.
5.6 Validation
The performance of AdCO is validated against three baseline strategies under identical workloads, contact patterns, and energy constraints: (i) Greedy Proximity–Deadline (GPD), which prioritizes tasks based on nearest location and earliest deadlines; (ii) Static Heuristic ACO (SH-ACO), which uses fixed pheromone and heuristic influences without drift resets or risk shaping; and (iii) Link-Quality-Only (LQO), which selects the best visible link for maximum instantaneous throughput without considering energy or deadlines. Across different load levels and satellite pass geometries, AdCO consistently reduced end-to-end latency and improved deadline adherence by aligning task sequencing with predicted link visibility and using fractional relay allocation, a feature missing in GPD. During both synthetic and real drift events, such as sudden pass loss or rate drops, AdCO’s event-triggered stabilization enabled faster recovery with lower overshoot compared to SH-ACO, demonstrating effective adaptive learning under regime shifts. Relative to LQO, AdCO achieved comparable or higher throughput while reducing tail reliability losses and lowering energy consumption per delivered bit through CVaR-driven flow diversification and energy-aware pacing. Overall, AdCO achieved superior multi-objective trade-offs, offering lower median and tail latency, fewer missed deadlines, reduced energy usage, and improved tail reliability, confirming the effectiveness of its design choices in spatiotemporal coupling, drift-aware adaptation, and risk-sensitive shaping.
5.6.1 Latency distribution (CDF)
Latency CDF show that AdCO can complete tasks in shorter time than all baseline strategies on the whole distribution rather on average. Its curve leans to the left and the slope is steeper, illustrating in Figure 8 that median latency decreases and the variance is lower. This performance improvement is due to contact-aware sequencing of tasks in terms of achieving the correlation of task execution with the future visibility of high rate satellite, and also in the avoidance of routes that result in a build-up of queues at the relay locations. In contrast, GPD considers spatial locality and deadlines but does not consider the timing of the pass, which results in larger relay delays. Although SH-ACO does not adjust its drift with the change of the environment which lead to its performance degrading over-time and LQO only considers the Instantaneous Link Quality (ILQ) and ignores the task timing and energy, which results in higher waiting time and transmission backoffs.
Figure 8. Measured latency CDFs under the medium-load scenario. The AdCO curve is left-shifted and steeper than baselines, indicating lower median and tail latency.
5.6.2 Deadline adherence across load
Figure 9 presents the performance, i.e., the percentage of deadlines that are met as a function of the offered load. For all the load levels, AdCO has a better feasible and it deteriorates more slowly even in heavy traffic. This is made possible by using fractional relay scheduling, that schedules flows over overlapping satellite passes over multiple days and allocates flows to “weaker” links without exceeding capacity. The stabilization using event-times even avoids the lasting performance decrease after regime changes. GPD, however, does not represent a feasible solution at the early subproblems due to the ignorance of visibility windows. SH-ACO operates well at the beginning and deteriorates at the presence of drift since it uses the same influence parameters, while LQO maintain throughput but it is not concerned with deadlines and its feasibility drops more quickly under conditions of high contention.
5.6.3 Energy reliability trade-off
Figure 10 shows the end-to-end success rate and the energy consumption per delivered bit. The performance of AdCO in this case remains always on the top-left part of the graph, where higher success rate at lower energies is reached. This gain is due to shaping based on CVaR, which helps to avoid risky low-elevation segments and spread load across more reliable contact opportunities, thereby reducing retransmissions and tail losses. Furthermore, energy-friendly scheduling also minimizes the redundant bursts of propulsion and load calculations, leading to even more efficiency gains. In contrast, GPD is likely to impose additional travel or waiting time that leads to higher energy consumption and lower reliability. SH-ACO maintains a good performance in a stable channel condition, but deteriorates when the channel changes; LQO can accommodate a high instantaneous throughput with a larger energy consumption per bit and an inferior tail reliability by concentrating traffic in the volatile time window.
5.7 Discussion
In all experimental settings, AdCO outperformed GPD, SH-ACO and LQO in terms of latency, deadline meeting ratio, tail reliability and energy usage. The enhancements were sustained even under loading demand and alterations of regime. In medium-load conditions, AdCO lowered measures as follows: the median end-to-end latency was reduced by an average of 20–
Table 4. Summary metrics at medium load (illustrative values; replace with measured numbers). Lower is better for latency and energy; higher is better for feasibility and success.
Table 5 schows that across the load sweep, AdCO degrades gracefully at high load (2.5 tasks/slot) it retains an 26-point advantage over GPD, 22 points over SH-ACO, and 24 over LQO. This reflects contact-aware sequencing and fractional relay allocation that sustain throughput without violating visibility and capacity. Tail-risk control and stabilization follow the trends shown in Table 6 and the regime-shift response. In AdCO, CVaR shaping reduces the worst-decile failure probability by about
6 Conclusion and future direction
The formulation of the UAV task scheduling and the fractional satellite relaying problem with dynamic visibility, capacity, and energy constraints is then proposed as a generic problem framework, and called as AdCO: adaptive colony optimization. The proposed AdCO adopts a unified framework for integrating three basic mechanisms, i.e., contact-aware sequencing for pass-aligned routing, event-triggered stabilization for rapid adaptation against regime change, and CVaR-based reliability shaping for lowering heavy-tailed losses. Evaluations over GPD, SH-ACO, and LQO demonstrate the gains in terms of smaller median and
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Author contributions
HM: Resources, Project administration, Validation, Visualization, Formal Analysis, Funding acquisition, Writing – review and editing, Methodology, Data curation, Supervision, Investigation, Writing – original draft, Conceptualization, Software.
Funding
The author(s) declare that no financial support was received for the research and/or publication of this article.
Conflict of interest
The author declares that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The author(s) declared that they were an editorial board member of Frontiers, at the time of submission. This had no impact on the peer review process and the final decision.
Generative AI statement
The author(s) declare that no Generative AI was used in the creation of this manuscript.
Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Footnotes
1The simplified version of the proposed algorithm is available under Appendix Section.
References
Basil, N., Mohammed, A. F., Sabbar, B. M., Marhoon, H. M., Dessalegn, A. A., Alsharef, M., et al. (2025). Performance analysis of hybrid optimization approach for uav path planning control using fopid-tid controller and haoaroa algorithm. Sci. Rep. 15, 4840. doi:10.1038/s41598-025-86803-4
Bazrafkan, A., Igathinathane, C., Bandillo, N., and Flores, P. (2025). Optimizing integration techniques for uas and satellite image data in precision agriculture—a review. Front. Remote Sens. 6, 1622884. doi:10.3389/frsen.2025.1622884
Behjati, M., Alobaidy, H. A., Nordin, R., and Abdullah, N. F. (2025). Uav-assisted federated learning with hybrid lora p2p/lorawan for sustainable biosphere. Front. Commun. Netw. 6, 1529453. doi:10.3389/frcmn.2025.1529453
Bonyadi, M. R., Li, X., and Michalewicz, Z. (2014). A hybrid particle swarm with a time-adaptive topology for constrained optimization. Swarm Evol. Comput. 18, 22–37. doi:10.1016/j.swevo.2014.06.001
Chen, M., Yang, J., Zhou, J., Hao, Y., Zhang, J., and Youn, C.-H. (2018). 5g-smart diabetes: toward personalized diabetes diagnosis with healthcare big data clouds. IEEE Commun. Mag. 56, 16–23. doi:10.1109/mcom.2018.1700788
de Forges de Parny, L., Alibart, O., Debaud, J., Gressani, S., Lagarrigue, A., Martin, A., et al. (2023). Satellite-based quantum information networks: use cases, architecture, and roadmap. Commun. Phys. 6, 12. doi:10.1038/s42005-022-01123-7
Deng, X., Zeng, S., Chang, L., Wang, Y., Wu, X., Liang, J., et al. (2022). An ant colony optimization-based routing algorithm for load balancing in leo satellite networks. Wirel. Commun. Mob. Comput. 2022, 3032997. doi:10.1155/2022/3032997
Divband Soorati, M., Zahadat, P., Ghofrani, J., and Hamann, H. (2019). “Adaptive path formation in self-assembling robot swarms by tree-like vascular morphogenesis,” in Distributed autonomous robotic systems: the 14th international symposium (Springer), 299–311.
Fakoor, M., Bakhtiari, M., and Soleymani, M. (2016). Optimal design of the satellite constellation arrangement reconfiguration process. Adv. Space Res. 58, 372–386. doi:10.1016/j.asr.2016.04.031
Gu, C. H., Khatib, L. A., Fitzgerald, A. S., Graham-Wooten, J., Ittner, C. A., Sherrill-Mix, S., et al. (2023). Tracking gut microbiome and bloodstream infection in critically ill adults. Plos one 18, e0289923. doi:10.1371/journal.pone.0289923
Han, C., Xiong, W., and Yu, R. (2024). A hybrid forecasting model for self-similar traffic in leo mega-constellation networks. Aerospace 11, 191. doi:10.3390/aerospace11030191
Hasanain, W., Madoery, P. G., Yanikomeroglu, H., Kurt, G. K., Siddiqui, S., Martel, S., et al. (2025). Dynamic activation and assignment of sdn controllers in leo satellite constellations. arXiv Prepr. arXiv:2507. doi:10.48550/arXiv.2507.16774
He, J., Cheng, N., Yin, Z., Zhou, H., Zhou, C., Aldubaikhy, K., et al. (2024). Load-aware network resource orchestration in leo satellite network: a gat-based approach. IEEE Internet Things J. 11, 15969–15984. doi:10.1109/jiot.2024.3350072
Jubair, M. A., Mohamed Najeeb, S. M., Thanoon, K. H., Hilal Alzakwani, M. H., Abbas, F. H., and Ali, R. R. (2025). Multi-objective optimization in satellite-assisted uavs. J. Intelligent Syst. and Internet Things, 16.
Kahraman, I., Kose, A., Koca, M., and Anarim, E. (2023). Age of information in internet of things: a survey. IEEE internet things J 11, 9896–9914. doi:10.1109/jiot.2023.3324879
Kohani, S., and Zong, P. (2019). A genetic algorithm for designing triplet leo satellite constellation with three adjacent satellites. Int. J. Aeronautical Space Sci. 20, 537–552. doi:10.1007/s42405-019-00149-6
Li, C., Sun, X., and Zhang, Z. (2021a). Effective methods and performance analysis of a satellite network security mechanism based on blockchain technology. IEEE Access 9, 113558–113565. doi:10.1109/access.2021.3104875
Li, Y., Xie, J., Xia, M., Li, Q., Li, M., Guo, L., et al. (2021b). Dynamic resource pricing and allocation in multilayer satellite network. Comput. Mater. and Continua 69, 3619–3628. doi:10.32604/cmc.2021.016187
Li, Y., Gao, A., Li, H., and Li, L. (2024). “An improved whale optimization algorithm for uav swarm trajectory planning,” in Advances in Continuous and Discrete Models 2024, 40.
Li, H., Gong, P., Li, S., Wang, W., Liu, Y., Gao, X., et al. (2025a). Collaborative federated learning of unmanned aerial vehicles in space–air–ground integrated network. Space Sci. and Technol. 5, 0264. doi:10.34133/space.0264
Li, H., Xiao, M., Wang, K., Kim, D. I., and Debbah, M. (2025b). Large language model based multi-objective optimization for integrated sensing and communications in uav networks. IEEE Wirel. Commun. Lett. 14, 979–983. doi:10.1109/lwc.2025.3529082
Liu, W., Xiong, W., and Han, C. (2022). Communication satellite resource scheduling based on improved whale optimization algorithm. In MATEC Web Conf., Xiamen, China. 355, 03001. doi:10.1051/matecconf/202235503001
Liu, W., Jin, Y., Zhang, L., Gao, Z., and Tao, Y. (2024). Dynamic access task scheduling of leo constellation based on space-based distributed computing. J. Syst. Eng. Electron. 35, 842–854. doi:10.23919/jsee.2024.000071
Lozano-Cuadra, F., and Soret, B. (2024). “Multi-agent deep reinforcement learning for distributed satellite routing,” in 2024 IEEE international conference on machine learning for communication and networking (ICMLCN) (IEEE), 1–2.
Lozano-Cuadra, F., Soret, B., Leyva-Mayorga, I., and Popovski, P. (2025). Continual deep reinforcement learning for decentralized satellite routing. IEEE Trans. Commun., 1. doi:10.1109/tcomm.2025.3562522
Mai, X., Dong, N., Liu, S., and Chen, H. (2023). Uav path planning based on a dual-strategy ant colony optimization algorithm. Intell. and Robotics 3, 666–683. doi:10.20517/ir.2023.37
Maric, S., Baidar, R., Abbas, R., and Reisenfeld, S. (2025). System security framework for 5g advanced/6g iot integrated terrestrial network-non-terrestrial network (Tn-ntn) with ai-enabled cloud security. arXiv Prepr. arXiv:2508.05707.
Melillos, G., Themistocleous, K., Agapiou, A., Michaelides, S., Papadavid, G., and Hadjimitsis, D. G. (2019). “The use of field spectroscopy for the implementation of vegetation indices for the satellite remote sensing detection of underground military structures in Cyprus,” in IGARSS 2019-2019 IEEE international geoscience and remote sensing symposium (IEEE), 3570–3573.
Miller, A. (2025). “Micius, the world’s first quantum communication satellite, was hackable,” in 2025 international conference on quantum communications, networking, and computing (QCNC) (IEEE), 411–415.
Paek, S. W., Kim, S., and de Weck, O. (2019). Optimization of reconfigurable satellite constellations using simulated annealing and genetic algorithm. Sensors 19, 765. doi:10.3390/s19040765
Panaitopol, D., Jin, Y., Tang, R., and Park, C. (2023). Requirements on satellite access node and user equipment for non-terrestrial networks in 5g new radio of 3gpp release-17. Int. J. Satell. Commun. Netw. 41, 289–301. doi:10.1002/sat.1459
Peng, Q., Luo, Q., Ma, Y., Foh, C. H., Xiao, P., Elkashlan, M., et al. (2025). Dynamic resource allocation in distributed mimo-leo satellite networks. arXiv Prepr. arXiv:2505.20891.
Pham, Q.-V., Mirjalili, S., Kumar, N., Alazab, M., and Hwang, W.-J. (2020). Whale optimization algorithm with applications to resource allocation in wireless networks. IEEE Trans. Veh. Technol. 69, 4285–4297. doi:10.1109/tvt.2020.2973294
Pirandola, S. (2021). Satellite quantum communications: fundamental bounds and practical security. Phys. Rev. Res. 3, 023130. doi:10.1103/physrevresearch.3.023130
Soret, B., Leyva-Mayorga, I., Mercado-Martínez, A. M., Moretti, M., Jurado-Navas, A., Martinez-Gost, M., et al. (2024). Semantic and goal-oriented edge computing for satellite earth observation. arXiv preprint arXiv:2408.15639
Stodola, P., Nohel, J., and Horák, L. (2025). Dynamic reconnaissance operations with uav swarms: adapting to environmental changes. Sci. Rep. 15, 15092. doi:10.1038/s41598-025-00201-4
Toppen, W., Johansen, D., Sareh, S., Fernandez, J., Satou, N., Patel, K. D., et al. (2017). Improved costs and outcomes with conscious sedation vs general anesthesia in tavr patients: time to wake up? PLoS One 12, e0173777. doi:10.1371/journal.pone.0173777
Wang, Y., Su, Z., Ni, J., Zhang, N., and Shen, X. (2021). Blockchain-empowered space-air-ground integrated networks: opportunities, challenges, and solutions. IEEE Commun. Surv. and Tutorials 24, 160–209. doi:10.1109/comst.2021.3131711
Wang, Z., Ding, H., Yang, J., Hou, P., Dhiman, G., Wang, J., et al. (2022). Orthogonal pinhole-imaging-based learning salp swarm algorithm with self-adaptive structure for global optimization. Front. Bioeng. Biotechnol. 10, 1018895. doi:10.3389/fbioe.2022.1018895
Wang, J., Ouyang, H., Zhang, C., Li, S., and Xiang, J. (2023). A novel intelligent global harmony search algorithm based on improved search stability strategy. Sci. Rep. 13, 7705. doi:10.1038/s41598-023-34736-1
Wang, X., Feng, Y., Tang, J., Dai, Z., and Zhao, W. (2024). A uav path planning method based on the framework of multi-objective jellyfish search algorithm. Sci. Rep. 14, 28058. doi:10.1038/s41598-024-79323-0
wei Jiang, W., Han, H., Zhang, Y., and Mu, J. (2024). “Multi-controller placement in software defined satellite networks: a meta-heuristic approach,” in 2024 IEEE 99th vehicular technology conference (VTC2024-Spring) (IEEE), 1–7.
Yang, Y., Wu, X., Li, J., Zang, J., Lu, J., and Zgeib, R. (2024). Configuration design method of mega constellation for low earth orbit observation. Space Sci. and Technol. 4, 0175. doi:10.34133/space.0175
Yosri, A. M., Azam, A., Alanazi, F., Alshehri, A. H., and Okail, M. A. (2024). Shear strength and particle breakage of construction and demolition waste as a function of moisture state and compaction level: insights for sustainable highway engineering. Plos one 19, e0298765. doi:10.1371/journal.pone.0298765
Zengshan, Y., Changhao, W., Chongbin, G., Yuanchun, L., Mengwei, X., Weiwei, G., et al. (2025). A comprehensive survey of orbital edge computing: systems, applications, and algorithms. Chin. J. Aeronautics 38, 103316. doi:10.1016/j.cja.2024.11.026
Zhang, H., Cheng, D., Kou, Q., Asad, M., and Jiang, H. (2024). Indicative vision transformer for end-to-end zero-shot sketch-based image retrieval. Adv. Eng. Inf. 60, 102398. doi:10.1016/j.aei.2024.102398
Zhang, Z., Dong, T., Yin, J., Xu, Y., Luo, Z., Jiang, H., et al. (2025). A particle swarm optimization-based queue scheduling and optimization mechanism for large-scale low-earth-orbit satellite communication networks. Sensors 25, 1069. doi:10.3390/s25041069
Zhao, J., Cai, R., and Sun, W. (2021). Regional sea level changes prediction integrated with singular spectrum analysis and long-short-term memory network. Adv. Space Res. 68, 4534–4543. doi:10.1016/j.asr.2021.08.017
Zhou, Y., and Zheng, H. (2013). A novel complex valued cuckoo search algorithm. Sci. World J. 2013, 597803. doi:10.1155/2013/597803
Zhou, J., Li, B., and Meng, Q. (2022). “Optimized design method for satellite constellation configuration based on real-time coverage area evaluation,” in 2022 29th international conference on geoinformatics (IEEE), 1–5.
Zhu, M., Xu, X., Zhang, Y., Zhou, Y., Du, P., Xu, D., et al. (2023). “Dynamic resource allocation for network slicing in leo satellite networks,” in International conference on communications and networking in China (Springer), 112–125.
Zhu, S., Huang, Y., Li, L., Wei, X., and Liu, B. (2024). Research on laser induced plasma ignition of gas oxygen/methane. Acta Astronaut. 217, 208–220. doi:10.1016/j.actaastro.2024.01.015
Appendix
FIGURE A1. Simplified version of Algorithm 1.
Keywords: non-terrestrial IoT, UAV swarms, LEO satellites, adaptive colony optimization, dynamic task allocation, scheduling, edge computing, connectivity-aware orchestration
Citation: Mohapatra H (2025) Adaptive ant colony methods for UAV LEO coordination in non terrestrial IoT. Front. Commun. Netw. 6:1691346. doi: 10.3389/frcmn.2025.1691346
Received: 23 August 2025; Accepted: 06 October 2025;
Published: 29 October 2025.
Edited by:
Athirah Mohd Ramly, University of East London, United KingdomReviewed by:
Ahmad Anwar Zainuddin, International Islamic University Malaysia, MalaysiaKhairayu Badron, International Islamic University Malaysia, Malaysia
Copyright © 2025 Mohapatra. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Hitesh Mohapatra, aGl0ZXNobWFoYXBhdHJhQGdtYWlsLmNvbQ==