Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. For. Glob. Change, 01 December 2025

Sec. Fire and Forests

Volume 8 - 2025 | https://doi.org/10.3389/ffgc.2025.1705341

This article is part of the Research TopicClimate Change, Forest Fire Risks, and Adaptation Strategies for Sustainable Ecosystem ManagementView all 4 articles

Advancing wildfire susceptibility mapping through maching learning and SHapley Additive exPlanations-integrated geospatial analysis in Northern Morocco’s Mediterranean region

  • 1Department of Geography, Faculty of Humanities and Social Sciences, Ibn Tofail University, Kenitra, Morocco
  • 2Faculty of Sciences, University Ibn Tofail, Kenitra, Morocco
  • 3Department of Computer Science, Faculty of Sciences, University Ibn Tofail, Kenitra, Morocco
  • 4Department of Geography and Environmental Sustainability, College of Humanities and Social Sciences, Princess Nourah bint Abdulrahman University, P.O. Box 84428, Riyadh, Saudi Arabia
  • 5Institute of Environmental Engineering, RUDN University, Moscow, Russia
  • 6Geological and Geophysical Engineering Department, Faculty of Petroleum and Mining Engineering, Suez University, Suez, Egypt

Wildfires pose a major environmental threat to Mediterranean ecosystems, intensified by climate change and growing human pressures. Yet, limited research has combined machine learning (ML) and SHapley Additive exPlanations (SHAP) to jointly assess predictive accuracy and interpret wildfire-driving mechanisms, particularly in data-scarce regions such as Northern Morocco’s Tangier–Tétouan–Al Hoceima (TTA) area—a recognized wildfire hotspot requiring advanced predictive tools for effective risk mitigation. This study applied a multi-model ML framework to map wildfire susceptibility by integrating environmental, climatic, and topographic variables with historical fire records. Remote sensing indices (NDVI, LST, wind speed) from summer 2022 were combined with topographic parameters (elevation, slope, aspect, TWI) and proximity measures (distance to roads, settlements, streams) derived from regional datasets. Five ML algorithms—CART, k-NN, SVM, LightGBM, and XGBoost—were tested, with SHAP was employed to interpret model behavior. Among these, XGBoost achieved the highest performance (accuracy = 0.920; F1-fire = 0.926; F1-nonfire = 0.912), followed by LightGBM (accuracy = 0.905; AUC = 0.965), confirming the superiority of gradient boosting techniques over conventional models. SHAP analysis identified NDVI as the most influential predictor, underscoring vegetation density as the primary driver of fire susceptibility through its contribution to fuel load. Secondary predictors varied: LightGBM emphasized elevation and wind speed, whereas XGBoost highlighted LST and wind speed. Interaction effects revealed that concurrent high temperatures and strong winds during Chergui events, as well as interactions between vegetation density and terrain position, substantially increase fire likelihood. Overall, wildfire susceptibility in Mediterranean landscapes arises from complex, non-linear interactions among vegetation, topography, and meteorological extremes. The resulting susceptibility maps deliver actionable insights for targeted fire prevention, resource allocation, and early warning, providing a robust framework to enhance adaptive wildfire management in Morocco’s most vulnerable ecosystems.

1 Introduction

Wildfires represent a defining ecological process in Mediterranean ecosystems (Salis et al., 2023; Davis et al., 2025), shaped by hot, dry summers and extensive human land-use transformations. Recent years have witnessed an alarming escalation in wildfire activity, with the summer of 2022 emerging as a particularly catastrophic benchmark across the Mediterranean basin. Spain experienced its warmest summer in 700 years (Serrano-Notivoli et al., 2023), while France confronted unprecedented fires that revealed critical shifts in fire patterns. A detailed assessment of the 2022 French fire season documented 42,520 hectares burned, with particularly severe impacts on previously less vulnerable forest types (Vallet et al., 2023). Several other Mediterranean countries, including Portugal and Italy, reported their most severe fire seasons in decades (Couto et al., 2022; Rodrigues et al., 2023). This regional crisis has continued to intensify, with 2025 alone surpassing 1 million hectares burned across Europe and the Mediterranean, a new record was established even before the peak fire season concluded (Niranjan, 2025). Underscoring the regional scale crisis and highlighting the urgent need for improved predictive modeling and adaptive fire management strategies across the Mediterranean region.

Similarly, Northern Morocco’s Tangier-Tétouan-Al Hoceima (TTA) region faces escalating wildfire challenges, where fires increasingly endanger forest resources, disrupt communities, and threaten economic assets (Boubekraoui et al., 2023). Recent severe outbreaks include a major fire in Chefchaouen forest that burned over 500 hectares in August 2025 (Toutate, 2025), prompting extreme risk alerts and highlighting the region’s growing vulnerability to intense wildfire events.

Assessing wildfire susceptibility is vital for effective mitigation; however, traditional geospatial methods that depend on expert-assigned weighting often struggle to capture the nonlinear relationships among environmental, climatic, and anthropogenic factors (Iban and Aksu, 2024). Machine learning (ML) provides a data-driven alternative capable of modeling such complex interactions to enhance early warning systems and inform policy decisions (Abdollahi and Pradhan, 2023; Ejaz and Choudhury, 2025; Zhao et al., 2025). Various ML algorithms—including Classification and Regression Trees (CART), Random Forest (RF), Support Vector Machine (SVM), k-Nearest Neighbors (k-NN), XGBoost, and LightGBM—have demonstrated reasonable predictive performance in wildfire modeling (Abujayyab et al., 2022; Chen et al., 2024; Sheriff et al., 2025).

To address transparency and interpretability challenges inherent in ML models, explainable artificial intelligence (XAI) approaches—particularly the SHAP framework—have been increasingly applied in Mediterranean and global wildfire studies (Iban and Aksu, 2024; Lee et al., 2025; Liao et al., 2025). Together, these advancements highlight the potential of combining ML predictive power with SHAP-based interpretability to deepen the understanding of wildfire dynamics in Mediterranean ecosystems, where fragile landscapes, human activities, and climatic extremes intersect to amplify fire risk.

Accordingly, this study develops an interpretable wildfire susceptibility framework for Northern Morocco’s TTA region by: (1) evaluating five ML algorithms (CART, SVM, k-NN, XGBoost, and LightGBM) for predictive accuracy; (2) applying SHAP analysis to identify dominant drivers and their interactions; and (3) producing spatially explicit risk maps to guide prevention efforts and resource allocation. The integrated ML-XAI framework links advanced analytical capabilities with practical wildfire management needs in this highly vulnerable Mediterranean setting.

2 Materials and methods

2.1 Study area and recent wildfire trends in Northern Morocco

The TTA region, located in northwestern Morocco, is shown in Figure 1. The region is predominantly mountainous, as it encompasses a large portion of the Rif range, characterized by steep slopes, rugged terrain, and narrow valleys. These geomorphological conditions are combined with dense forest cover.

Figure 1
Map of northern Morocco showing fire and no-fire points marked in yellow and blue, respectively. Major cities like Tanger, Tétouan, and Chefchaouen are labeled. Elevation is indicated by a color gradient from light to dark red, with a scale showing high and low points. Inset map shows the study area location within Morocco.

Figure 1. Location map of the TTA region, Northern Morocco.

Climatically, the region is under strong Mediterranean influence, with contrasting sub-humid to humid conditions in coastal and mid-altitude areas, and dry conditions in inland valleys. Elevations range from sea level to more than 2,400 meters. The region also has a 447 km coastline along both the Mediterranean and Atlantic, which moderates temperatures but does not prevent recurrent summer droughts and heatwaves.

From a demographic perspective, the region has a population of more than 4 million inhabitants (RGPH, 2024), with an average density of about 233 inhabitants/km2, making it one of the most densely populated regions in Morocco. This demographic pressure, particularly urban expansion and rural land-use change, intensifies pressures on natural resources.

The TTA region has consistently recorded Morocco’s highest wildfire frequency and burned areas in recent years (Badda et al., 2023; Boubekraoui et al., 2023). In 2022, Morocco experienced approximately 500 wildfires that burned 22,762 hectares nationwide. The TTA region accounted for a disproportionate share of this damage, representing 37.6% of all fire incidents (188 of 500 fires) and 82.2% of the total burned area (18,704 of 22,762 hectares) (L’Economiste, 2023). This concentration of fire activity underscores the region’s exceptional vulnerability within Morocco’s wildfire landscape. The pattern persisted through subsequent years, with the TTA region maintaining its status as the country’s most fire-affected area, culminating in the August 2025 Chefchaouen fires that consumed over 500 hectares under dry Chergui wind conditions (Figure 2).

Figure 2
Three satellite images of a landscape taken on different dates: August 8, 2025, August 15, 2025, and August 27, 2025. The images show progressive changes in vegetation with increasing red areas, indicating potential environmental changes or impacts over time.

Figure 2. August 2025 Sentinel-2 Short-Wave Infrared (SWIR) Time-Series of Wildfires in Chefchaouen Province, Morocco. This figure shows false-color satellite imagery capturing two different wildfire events in August 2025. On 08 August 2025, the scene appears unaffected, with healthy vegetation highlighted in green. By 15 August 2025, the first fire is clearly visible as a large, burned scar in red tones on the eastern side of the image. A second fire event is detected by 27 August 2025, with new burned areas appearing in the northern part of the scene, while the earlier scar in the east remains evident. Together, these images illustrate the occurrence of two distinct wildfires within the same region over a short time span, emphasizing both the intensity and recurrence of fire activity.

2.2 Conditioning factors

In this study, a combination of environmental, topographic, climatic, and anthropogenic conditioning factors was integrated (Tables 1; Figure 3), as illustrated in the methodological framework (Figure 4). These factors, including aspect, elevation, slope, topographic wetness index (TWI), land surface temperature (LST), wind speed, vegetation indices (NDVI), and proximity indicators (roads, settlements, and water bodies), were selected due to their established influence on fire ignition, spread, and severity (Michael et al., 2021; Singha et al., 2024; Ahmad et al., 2025). Table 2 summarizes the conditioning factors, their effects on wildfire behavior, and the supporting literature.

Table 1
www.frontiersin.org

Table 1. Data utilized in the development of the wildfire susceptibility model.

Figure 3
Six-panel map of a coastal region bordered by the Atlantic Ocean and Mediterranean Sea. Panel descriptions: Top left shows TWI with a gradient from low (6.25) to high (20.7) in shades of blue. Top right displays slope with a gradient from low (0) to high (75.85) in green to red. Middle left illustrates aspect with multicolor direction indicators. Middle right highlights distance to settlements with gradients of green; highest distance is 29,084 meters. Bottom left shows distance to streams in blue; highest is 15,908.4 meters. Bottom right maps distance to roads in purple; highest is 6,932.71 meters. Four maps depict different environmental characteristics of northern Morocco. The first map shows NDVI with green indicating high values and red low. The second map outlines wind speed, ranging from blue for low to red for high speeds. The third map reveals land surface temperature with blue for cooler areas and red for warmer. The fourth map presents elevation, using light to dark brown, representing low to high altitudes. Cities such as Tanger, Al-Hoceima, and Ouezzane are marked.

Figure 3. Conditioning variables influencing wildfire susceptibility.

Figure 4
Flowchart depicting a machine learning process for fire occurrence prediction. Conditioning factors include NDVI, TWI, aspect, LST, and more. The training dataset contains fire occurrence data and no-fire areas, followed by descriptive statistics and multicollinearity test. The dataset is split 70-30 for training and validation. Machine learning models used are CART, XGBoost, LightGBM, SVM, and k-NN. Outputs are validated and explained using SHapley Additive exPlanations.

Figure 4. Workflow of wildfire susceptibility modeling using machine learning and SHAP analysis.

Table 2
www.frontiersin.org

Table 2. Conditioning factors used in this study and their influence on wildfire behavior.

Dynamic variables such as NDVI, LST, and wind speed were derived from remote sensing products corresponding specifically to the summer of 2022, ensuring that all input layers reflect the same environmental and climatic conditions prevailing during that fire season. This temporal alignment between conditioning factors and fire occurrences was essential to preserve internal consistency. Using datasets from multiple years could introduce temporal mismatches, as post-fire vegetation loss or regeneration significantly alters NDVI and surface temperature values in subsequent years. Thus, using static 2022 layers allowed the model to capture realistic pre-fire conditions rather than post-disturbance states.

Spatial analysis layers were systematically generated using GIS-based methods (Table 3). In contrast, static variables such as topography (slope, aspect, and elevation), TWI, and proximity factors (roads, streams, and settlements) were derived from national datasets that remain stable over time. Population density was also included as a socio-demographic driver, using data from the most recent national census (2014), since updated spatial population data were unavailable.

Table 3
www.frontiersin.org

Table 3. GIS-based techniques are applied for generating spatial layers.

2.3 Machine learning in wildfire risk mapping

Machine learning has become a cornerstone in wildfire risk assessment, offering robust frameworks to model complex, nonlinear relationships between environmental factors and fire occurrence. Among the spectrum of ML approaches, methods such as Classification and Regression Trees (CART), Random Forest (RF), Support Vector Machine (SVM), k-Nearest Neighbors (k-NN), XGBoost, and LightGBM stand out for their proven effectiveness (Table 4).

Table 4
www.frontiersin.org

Table 4. Overview of machine learning models applied in wildfire susceptibility studies.

2.4 Construction of the training dataset

The construction of the training dataset followed a multi-step approach to ensure accuracy and representativeness. Wildfire occurrence records were obtained from the MODIS Fire Information for Resource Management System (FIRMS), which provides daily global fire detections based on thermal anomalies (Islam et al., 2017). To ensure temporal coherence with the dynamic conditioning factors, only fire points from 2022 were retained for model training and validation. This approach guarantees that the spectral and climatic characteristics at each fire location correspond to the actual environmental conditions at the time of the event. Including fire records from different years would have led to inconsistencies between fire locations and the static 2022 predictor maps, since post-fire changes can significantly alter vegetation and surface properties.

Following data compilation, extensive cleaning and verification were performed, including the removal of duplicates, exclusion of points outside the study area, and correction of spatial overlaps. Non-fire points were randomly generated across the study area while ensuring they did not overlap with recorded fires or buffer zones around them. To maintain class balance, an equal number of fire and non-fire points was retained.

The final dataset consisted of 228 fire points and 228 non-fire points, providing a balanced and representative sample of burned versus unburned areas. This dataset formed the foundation for training and validating the machine learning models, ensuring reliable and unbiased evaluation of wildfire susceptibility patterns.

2.5 Multicollinearity assessment

To guarantee the robustness of the analysis, it was necessary to examine the potential presence of multicollinearity among the explanatory variables. Multicollinearity arises when predictors are strongly interrelated, which can lead to redundancy and distort model outputs. A widely recognized approach for diagnosing this issue is the Variance Inflation Factor (VIF) (Moumane et al., 2025).

2.6 Performance metrics

To ensure a robust assessment of the machine learning models developed for wildfire susceptibility mapping, the dataset was divided into two subsets: 70% for training and 30% for testing. This partitioning allowed the algorithms to learn from a representative portion of the data while preserving an independent sample for unbiased validation (Moumane et al., 2025).

Model performance was evaluated using several complementary statistical indicators that collectively measure classification accuracy and predictive reliability. The confusion matrix provided a comprehensive summary of classification outcomes by quantifying true positives (TP), false positives (FP), true negatives (TN), and false negatives (FN). Beyond identifying misclassification patterns and class-specific biases, the confusion matrix was used to compute the overall accuracy of each model. Additionally, it served as the basis for calculating the F1-Score, which represents the harmonic mean of precision and recall, offering a balanced measure of model performance, particularly when handling datasets with class imbalance (Equations 14).

Overall accuracy(%)=number of correctly classified samplestotal number of samples×100    (1)
F1=2×Precision×RecallPrecision+Recall    (2)
Precision=True PositiveTrue Positive+False Positive    (3)
Recall=True PositiveTrue Positive+False Negative    (4)

The Receiver Operating Characteristic (ROC) curve and its corresponding Area Under the Curve (AUC) were used to assess the models’ ability to discriminate between areas with high and low wildfire susceptibility. A higher AUC value indicates stronger model performance in distinguishing susceptible zones from non-susceptible ones.

2.7 Predictive susceptibility mapping

To create continuous fire susceptibility maps, the study area raster was first converted into a dense point dataset within ArcGIS 10.8. For each point, the values of the 10 conditioning factors were extracted and, following a data cleaning process to remove entries with missing values, the dataset was exported to Google Colab. There, the trained machine learning models were applied to predict a susceptibility value for each point, resulting in a value between 0 (non-fire point) and 1 (fire point) alongside its geographic coordinates. These predicted points were then re-imported into ArcGIS, where the Kriging interpolation method was employed to transform the discrete point data into a seamless spatial surface. Finally, this continuous output was classified into four distinct susceptibility zones: Low (0–0.25), Medium (0.25–0.5), High (0.5–0.75), and Very High (0.75–1).

2.8 SHAP Explainability analysis

To complement the machine learning models, we employed the SHAP framework to interpret the contribution of conditioning factors to wildfire susceptibility. SHAP is grounded in cooperative game theory and provides a unified measure of feature importance by assigning each predictor a marginal contribution to the model’s output. Unlike traditional importance metrics that only capture global relevance, SHAP values enable both global interpretation (ranking variables by their average influence across the study area) and local interpretation (examining how each factor drives individual predictions).

2.9 Computational environment

All machine learning modeling and computational experiments were conducted within the Google Colaboratory cloud-based environment. The analyses were performed using Python 3.10.12, leveraging a suite of standard data science and machine learning libraries, including `scikit-learn` for CART, SVM, and k-NN implementations, `XGBoost`, and `LightGBM`. To ensure the complete reproducibility of all results, a fixed random seed of `42` was implemented globally. This seed was consistently applied to the train-test data split and to the initialization of all stochastic algorithms.

All models were configured with their default hyperparameters to provide a baseline for comparison, with the exception of the random seed specification.

SHAP analysis was performed. The analysis was conducted using the `shap` Python library (version 0.44.0) on the same feature dataset used for model training. The SHAP analysis was applied to the LightGBM and XGBoost models, as these are natively supported by SHAP’s TreeExplainer for exact computation of Shapley values.

For each model, the analysis generated:

• SHAP Summary Plots (Beeswarm plots): Visualizing the distribution of the impact each feature has on the model output for every individual prediction in a sampled dataset.

• SHAP Feature Importance Plots: Ranking the features based on their mean absolute SHAP values, representing their overall importance in the model’s decision-making process.

• SHAP Interaction Plots: Where computationally feasible, interaction values were calculated to reveal how feature pairs jointly impact the predictions.

3 Results

3.1 Descriptive statistics of features

Based on the comprehensive comparative analysis, the environmental factors distinguishing fire-prone areas from non-fire zones reveal a compelling pattern of fire susceptibility driven by specific topographic, vegetative, and anthropogenic conditions (Figures 5, 6; Tables 5, 6). The data confirm that fire occurrences are strongly associated with significantly higher elevation terrain, evidenced by a striking 115% increase in mean elevation at fire sites, moving from approximately 302 meters to 649 meters, indicating that fires predominantly affect higher altitude landscapes with distinct microclimates and vegetation types. This elevation effect is complemented by substantially steeper slopes in fire zones, showing a 69% increase in mean slope values (Table 7), which facilitates rapid drainage and enhances fire spread dynamics through improved air circulation and fuel drying. The vegetative profile further distinguishes fire-prone areas with a remarkable 93% increase in NDVI values, confirming that denser, healthier vegetation serves as the fundamental fuel source necessary for fire ignition and propagation. Spatial analysis reveals critical patterns of remoteness, with fire locations positioned considerably farther from human influence—showing a 118% increase in distance to settlements and 64% increase in distance to roads—highlighting how reduced human presence and limited fire response capability heighten vulnerability. Additionally, the hydrological context shows fire areas are 49% farther from streams, indicating moisture-deficient environments where reduced ambient humidity and limited water access create favorable fire conditions. The topographic wetness index further supports this aridity pattern with a 15% decrease in TWI values at fire sites, confirming that drier landscapes are more susceptible. While temperature and aspect show minimal differentiation, the consistency across all major environmental factors paints a clear portrait of fire-vulnerable landscapes as remote, high-elevation, densely vegetated territories where topographic steepness, vegetative abundance, and limited human intervention converge to create ideal conditions for fire occurrence, providing crucial validation for targeted fire susceptibility modeling and prevention strategies.

Figure 5
Nine histograms showing the distribution of various geographic and environmental variables. Each graph has frequency on the y-axis. Variables include NDVI, TWI, LST, Aspect, Elevation, Slope, WindSpeed, Dis_Stream, Dis_Settle, and Dis_Road. Mean and median values are marked consistently, indicated by dashed lines. All histograms have a red overlay, with data ranges detailed on the x-axes.

Figure 5. Distribution of environmental factors in wildfire training dataset (fire points).

Figure 6
Histogram plots showing distributions of various environmental and geographical metrics: NDVI, TWI, LST, Aspect, Elevation, Slope, WindSpeed, Dis_Stream, Dis_Settle, and Dis_Road. Each histogram includes mean and median values, indicated by vertical lines, representing data spread and central tendencies.

Figure 6. Distribution of environmental factors in wildfire training dataset (Non-fire points).

Table 5
www.frontiersin.org

Table 5. Descriptive statistics of environmental and topographic features in fire areas.

Table 6
www.frontiersin.org

Table 6. Descriptive statistics of environmental and topographic features in non-fire areas.

Table 7
www.frontiersin.org

Table 7. Changes of features between fire and non-fire areas.

3.2 Multicollinearity diagnosis

The multicollinearity assessment presented in Figure 7 combines two complementary analyses to evaluate redundancy among the 10 conditioning factors used for forest fire susceptibility modeling.

Figure 7
Panel a shows a correlation matrix with variable pairs colored according to their correlation values, ranging from blue for negative correlations to red for positive ones. Panel b displays a bar chart of the Variance Inflation Factor (VIF) and tolerance for various variables, with VIF values under the threshold of ten, indicating low multicollinearity.

Figure 7. Multicollinearity assessment of conditioning factors used for forest fire susceptibility modeling: (a) Pearson correlation matrix showing pairwise relationships between environmental and anthropogenic variables; (b) Variance inflation factor.

The Pearson correlation matrix (Figure 7a) reveals that most pairwise correlation coefficients range from weak to moderate (|r| < 0.6), suggesting generally low linear dependency between variables. The strongest correlation is observed between distance to settlements and distance to roads (r = 0.69), indicating significant spatial overlap in anthropogenic infrastructure patterns. A notable negative correlation exists between LST and wind speed (r = −0.58), reflecting the inverse relationship between temperature and atmospheric dynamics in the study area. Moderate positive correlations are evident between elevation and distance to streams (r = 0.41), consistent with expected topographic-hydrological relationships, and between NDVI and slope (r = 0.34), suggesting vegetation preference for steeper terrain. Other correlations, such as between aspect and remaining variables, remain weak (|r| < 0.3), indicating minimal directional influence on other environmental factors.

The Variance Inflation Factor (VIF) and tolerance analysis (Figure 7b) confirm the absence of problematic multicollinearity across all predictors. VIF values for all variables remain well below the conservative threshold of 10, with most factors demonstrating VIF scores below 3. This indicates that each conditioning factor contributes unique explanatory power to the susceptibility model without significant redundancy, ensuring stable parameter estimates and reliable model interpretation for forest fire prediction.

3.3 Comparative analysis of fire susceptibility areas across models

The comparative analysis of predicted fire susceptibility areas highlights clear spatial and quantitative differences among the five machine learning models (Figure 8). In the low-susceptibility class, LightGBM (65.1%) and XGBoost (64.7%) allocate the largest areas, indicating a classification approach that favors lower-risk identification across the landscape. In contrast, SVM (57.7%), k-NN (57.7%), and CART (57.3%) assign smaller extents to this class.

Figure 8
Stacked bar chart comparing risk level proportions across different algorithms: CART, LightGBM, SVM, XGBoost, and k-NN, each on an area of 16167 square kilometers. Risk levels are shown as low, medium, high, and very high. The proportions vary, with low risk generally having the highest percentage across all algorithms, particularly 65.1% for LightGBM and 64.7% for XGBoost. High and very high risks are higher in SVM and k-NN algorithms, respectively.

Figure 8. Area distribution of forest fire susceptibility classes predicted by machine learning algorithms.

For the medium-susceptibility class, CART records the highest estimate (8.8%), followed by k-NN (6.7%), while SVM predicts the lowest (4%).

Conversely, the very high-susceptibility class exhibits an opposite trend: SVM predicts a considerably large area (26.8%), more than double that of LightGBM (11.4%), with k-NN (17%) also showing a strong inclination toward extreme-risk mapping, while CART (10.6%) remains more restrained.

Ensemble-based models (LightGBM, XGBoost) tend to cluster their predictions toward low and medium hazard levels, while SVM and k-NN emphasize extreme-risk zones. CART occupies an intermediate position (Figure 9).

Figure 9
Five risk level maps of Libya show variations in risk categories: low, medium, high, and very high. Each map uses a different model: CART, LightGBM, SVM, XGBoost, and k-NN. Regions are shaded in colors representing risk levels, with notable differences in distribution patterns among models.

Figure 9. Spatial distribution of forest fire susceptibility across Northern Morocco as predicted by machine learning models.

3.4 Validation

The comparative assessment of five machine learning algorithms revealed clear differences in their predictive performance for wildfire susceptibility mapping. The CART model achieved moderate results (accuracy = 0.891, AUC = 0.889), whereas k-NN (accuracy = 0.766) and SVM (accuracy = 0.774) demonstrated noticeably weaker predictive capabilities (Figures 10, 11). In contrast, XGBoost yielded the most robust performance, attaining the highest accuracy (0.920) and excellent F1-scores for both fire (0.926) and non-fire (0.912) categories (Figures 12, 13), followed closely by LightGBM (accuracy = 0.905, AUC = 0.965). Both ensemble-based models substantially outperformed the conventional CART, k-NN, and SVM algorithms. The remarkably high AUC values (0.965) obtained for LightGBM and XGBoost underscore their exceptional ability to discriminate between fire and non-fire conditions, achieving a well-balanced trade-off between precision and recall. These findings confirm the superior efficiency of gradient boosting approaches in modeling complex environmental processes, particularly in capturing the nonlinear dependencies among wildfire-driving factors in Mediterranean landscapes.

Figure 10
ROC curve graph comparing five models: CART (AUC = 0.889), k-NN (AUC = 0.840), SVM (AUC = 0.878), LightGBM (AUC = 0.965), and XGBoost (AUC = 0.965). True Positive Rate is plotted against False Positive Rate.

Figure 10. Combined ROC curves illustrating classification performance of all machine learning models.

Figure 11
Confusion matrices for five machine learning models: CART, k-NN, SVM, LightGBM, and XGBoost, displaying predicted vs. actual fire occurrences. Accuracy and AUC scores are shown for each. CART: accuracy 0.891, AUC 0.889. k-NN: accuracy 0.766, AUC 0.840. SVM: accuracy 0.774, AUC 0.878. LightGBM: accuracy 0.905, AUC 0.965. XGBoost: accuracy 0.920, AUC 0.965. Values indicate correctly and incorrectly classified instances.

Figure 11. Comparative confusion matrices for all machine learning models.

Figure 12
Bar chart comparing the accuracy of five machine learning models: CART with 89.1%, k-NN with 76.6%, SVM with 77.4%, LightGBM with 90.5%, and XGBoost with 92.0%. XGBoost shows the highest accuracy.

Figure 12. Accuracy assessment of all machine learning models.

Figure 13
Six bar graphs showing precision, recall, and F1-scores for

Figure 13. Comparative bar chart illustrating accuracy, precision, recall, and F1-score for fire and non-fire points across five machine learning models.

3.5 SHapley additive exPlanations analysis

3.5.1 Global feature importance

The SHAP global feature importance results reveal distinct patterns in wildfire susceptibility drivers between the two ensemble models (Figure 14). Both LightGBM and XGBoost consistently identify NDVI as the most influential variable, with mean SHAP values of approximately 3.0 and 2.0, respectively, indicating its dominant role in shaping wildfire predictions. High NDVI values, representing denser vegetation, consistently increase fire risk by providing abundant combustible biomass, while sparse vegetation (low NDVI) reduces susceptibility.

Figure 14
Bar charts comparing SHAP feature importance for LightGBM and XGBoost models. Both charts show NDVI as the most important feature. For LightGBM, other significant features are Elevation and WindSpeed. In XGBoost, LST, WindSpeed, and Elevation follow NDVI in importance.

Figure 14. Global SHAP bar plot of average feature importance.

However, the models diverge in their secondary predictors. LightGBM prioritizes Elevation (≈2.5 SHAP) and WindSpeed (≈2.2 SHAP) as the next most important factors, suggesting this model emphasizes topographic and meteorological influences. Higher elevations may correlate with specific fuel types and microclimates, while increased wind speeds accelerate fire spread and intensity.

In contrast, XGBoost assigns greater importance to LST (Land Surface Temperature, ≈1.8 SHAP) as the second-most influential variable, followed by WindSpeed (≈1.6 SHAP) and Elevation (≈1.5 SHAP). This indicates XGBoost places stronger emphasis on thermal conditions that promote fuel drying and ignition probability.

Both models show moderate contributions from Distance to Streams and Distance to Settlements, though their relative importance differs between algorithms. Lower-impact variables include Slope, TWI, Aspect, and Distance to Roads, with minimal influence on model predictions.

The divergence in feature rankings between LightGBM and XGBoost highlights the complex, multi-factorial nature of wildfire susceptibility, where different algorithms capture varying aspects of the environmental drivers while converging on vegetation density as the primary determinant.

3.5.2 Beeswarm analysis of feature contributions

The SHAP beeswarm plot for LightGBM reveals distinct patterns in wildfire susceptibility drivers across the study area (Figure 15). NDVI dominates the model with the widest SHAP value spread (−8 to +4), where high vegetation density (red dots) strongly increases fire risk by providing abundant combustible biomass, while sparse vegetation (blue dots) significantly reduces susceptibility. Elevation shows a clear positive relationship, with higher elevations consistently pushing predictions toward higher risk, likely due to specific fuel types and microclimates in elevated terrain. WindSpeed demonstrates a strong directional effect where higher wind values substantially increase fire risk through accelerated spread and intensity, while calmer conditions provide protective effects. LST exhibits mixed but generally positive influence, with warmer conditions (red) tending to increase risk through enhanced fuel drying. Distance to settlements shows an interesting pattern where closer proximity moderately increases risk, suggesting human ignition sources, while more remote areas experience reduced fire probability. Distance to streams displays a protective effect, with closer water proximity reducing risk through moisture buffering, while drier distant areas show elevated susceptibility. Slope, TWI, Aspect, and Distance to roads show minimal but interpretable effects, with steeper slopes and drier conditions (low TWI) slightly increasing risk, while road proximity has negligible influence (Figure 16).

Figure 15
SHAP summary plots for LightGBM and XGBoost models show feature impacts on model output. Each plot displays SHAP values for features such as NDVI, Elevation, and WindSpeed, with colors indicating feature values from low (blue) to high (pink).

Figure 15. SHAP beeswarm plot of feature contributions to fire-risk prediction.

Figure 16
Dot plots compare SHAP interaction values for LightGBM and XGBoost models. Both models include features like NDVI, Elevation, WindSpeed, LST, Dis_Settle, Dis_Stream, and Slope. The plots display the distribution and impact of each feature on model predictions, with values ranging from negative to positive.

Figure 16. SHAP-based interaction analysis for LightGBM and XGBoost models.

The XGBoost SHAP beeswarm plot demonstrates a different feature importance hierarchy with unique interaction patterns. While NDVI remains the most influential predictor, its effect range is narrower (−6 to +2) compared to LightGBM, yet still shows high vegetation density strongly increasing risk and sparse vegetation providing protection. LST emerges as the second-most important variable in XGBoost, with a clear positive relationship where higher temperatures substantially increase fire risk through enhanced fuel desiccation, while cooler conditions reduce susceptibility. WindSpeed maintains a strong positive effect, with high wind values pushing predictions toward increased risk and calm conditions showing protective effects. Elevation displays a more moderate but consistent positive influence compared to LightGBM, with higher elevations associated with increased fire probability. Slope shows a noticeable positive effect in XGBoost, with steeper terrain increasing risk, possibly due to faster upslope fire spread. Distance to streams and settlements exhibit similar protective patterns as in LightGBM, with closer water proximity reducing risk and remote human settlements showing complex relationships. TWI, Aspect, and Distance to roads continue to show minimal influences, though XGBoost captures slightly stronger terrain moisture effects (TWI) than LightGBM, with drier conditions associated with marginally higher risk.

3.5.3 Feature interaction effects

The SHAP interaction plots reveal how environmental factors combine to shape wildfire susceptibility, with notable differences between LightGBM and XGBoost in capturing these complex relationships. Feature interactions are particularly important in fire modeling because fuel, weather, and topographic variables rarely act independently; their synergies often determine fire behavior and spread potential.

3.5.3.1 LightGBM interaction patterns

LightGBM captures distinct synergistic relationships between wildfire drivers, with elevation emerging as a key moderator of other factors (Table 8). The model shows strong positive interactions between NDVI and elevation, indicating that vegetation density has amplified effects in higher terrain, where fuel types and microclimates create heightened fire susceptibility. Wind speed interacts positively with multiple variables, particularly enhancing the effects of temperature and elevation to create extreme fire weather conditions. Interestingly, distance to settlements shows negative interactions with elevation, suggesting human presence modifies the fire risk typically associated with higher terrain. Water-related features (distance to streams) demonstrate buffering effects, particularly reducing wind impacts through moisture-mediated microclimate regulation.

Table 8
www.frontiersin.org

Table 8. Key LightGBM interaction patterns.

3.5.3.2 XGBoost interaction patterns

XGBoost emphasizes thermal-driven interactions, with land surface temperature acting as a universal amplifier across multiple feature combinations (Table 9). The model shows the strongest positive interaction between LST and wind speed, creating maximum fire risk scenarios under hot, windy conditions. Unlike LightGBM, XGBoost reveals strong positive synergy between NDVI and LST, indicating that temperature enhances rather than diminishes vegetation-related fire risk, possibly through accelerated fuel curing. Elevation maintains consistent positive interactions but with reduced magnitude compared to LightGBM, while slope shows enhanced importance through its interactions with wind and elevation, suggesting topographic complexity modifies fire spread patterns.

Table 9
www.frontiersin.org

Table 9. Key XGBoost interaction patterns.

The contrasting interaction patterns between models highlight different aspects of wildfire ecology: LightGBM emphasizes elevation-mediated relationships, while XGBoost focuses on thermal amplification and vegetation-weather synergies, providing complementary perspectives on fire susceptibility drivers.

4 Discussion

This study demonstrates that the combined use of machine learning and SHAP explainability delivers both robust predictive accuracy and ecological interpretability for wildfire susceptibility mapping in Mediterranean landscapes. Our multi-model approach revealed that ensemble methods, particularly LightGBM and XGBoost, achieved superior performance compared to traditional algorithms, with LightGBM attaining the highest predictive accuracy (Accuracy = 0.93, AUC = 0.98) followed closely by XGBoost (Accuracy = 0.91, AUC = 0.97). These findings align with broader regional observations, such as in Türkiye’s Antalya region, where ensemble methods similarly excel in susceptibility mapping using comprehensive sets of environmental, climatic, and anthropogenic factors (Bilucan et al., 2024).

The exceptional performance of our models must be understood within the broader context of unprecedented regional warming. Our study period coincided with the exceptional summer of 2022 documented across the Mediterranean basin, where Serrano-Notivoli et al. (2023) revealed that Spain experienced its warmest summer in 700 years, with record-breaking temperatures affecting nearly half the country. Similar synoptic patterns influenced Northern Morocco during our study period, explaining the prominence of thermal factors in our models.

The use of SHAP transforms “black box” models into transparent tools by revealing consistent and interpretable drivers across algorithms, addressing a major limitation in prior wildfire modeling studies. This aligns with Cilli et al. (2022), who developed an XAI framework across the Italian peninsula, identifying pivotal variables like NDVI and emphasizing the utility of SHAP in decision-making contexts. Similarly, Iban and Aksu (2024) applied SHAP-based interpretation to machine learning models in Turkey’s İzmir region, finding that SHAP effectively revealed critical drivers including NDVI, wind speed, temperature, slope, and settlement proximity.

Our SHAP analysis consistently identified NDVI as the dominant predictor across both models, with mean SHAP values of 3.0 and 2.0 for LightGBM and XGBoost, respectively, highlighting vegetation density as the primary indicator of combustible fuel load. The significance of wind speed in our models must be understood within the context of regional atmospheric patterns, particularly the Chergui wind—a hot, dry easterly wind that characterizes Moroccan summers and interacts synergistically with elevated temperatures to create dangerous fire weather conditions (Seddouki et al., 2023). Land Surface Temperature (LST) emerged as another critical factor, reflecting the regional thermal amplification that accelerates fuel desiccation and extends high flammability periods. Ensemble modeling studies in Türkiye further confirm that these factors collectively shape fire susceptibility (Purnama et al., 2024).

SHAP interaction analysis further exposed synergistic combinations, such as elevation interacting with wind speed or LST, combining with vegetation density to elevate risk zones. These nuanced insights provide actionable intelligence, suggesting that susceptibility mapping should prioritize specific factor interactions, not just risk hotspots. The strong positive interaction between elevated temperatures and wind speed during Chergui events demonstrates how compound climate extremes create disproportionately high fire risk. Such an approach is crucial for targeting early-warning systems, deployment of firefighting resources, and community-level risk mitigation during periods of concurrent high temperatures and strong winds.

While the results demonstrate strong predictive capacity, several limitations should be acknowledged. First, the availability of ground-based climate data—including wind speed, temperature, and humidity, remain limited in Morocco. Most local meteorological stations are not easily accessible to researchers, forcing reliance on reanalysis products (e.g., ERA5) or remote sensing proxies. Although these sources provide useful spatial coverage, they may not capture the fine-scale climatic variability that directly shapes fire ignition and spread. Second, accessing official fire records remains a challenge: obtaining the precise geographic coordinates of fire events involves lengthy and complex procedures, which limit the timeliness of fire datasets. This dependency on external institutional processes highlights the need for more open-access fire databases in Morocco.

Additional limitations persist in our methodological framework. Our modeling is based on a single year (2022) of wildfire data, a constraint that may limit temporal applicability despite capturing an exceptionally hot period that may preview future climate conditions. The study could be made more effective if real-time or high-resolution in situ climate data were integrated with remote sensing indicators. The combination of localized meteorological observations and satellite-based indices would improve both the accuracy of machine learning models and the interpretability of SHAP analysis, thereby yielding more reliable susceptibility maps.

Future work should incorporate multi-year datasets to account for interannual variability in fire-weather interactions and distinguish between transient extremes and persistent trends. Additionally, incorporating climate projections and socio-economic risk scenarios would improve forward-looking risk forecasting.

Overall, this study contributes to a growing body of evidence that interpretable machine learning, especially when paired with explainability frameworks like SHAP, offers a powerful and actionable path for Mediterranean wildfire research. By demonstrating the superior performance of ensemble methods while providing transparent insights into their decision-making processes, we bridge the gap between predictive analytics and field-level fire management. The integration of explainable AI with robust environmental modeling provides a framework that can adapt to evolving climate conditions while maintaining ecological interpretability—a crucial combination for effective wildfire management in Morocco’s most vulnerable ecosystems under changing climatic conditions.

5 Conclusion

This study demonstrates that integrating machine learning with SHAP-based explainability offers a robust framework for wildfire susceptibility mapping in Mediterranean regions under changing climatic conditions. By combining environmental, climatic, topographic, and anthropogenic factors with recent fire records, the models achieved high predictive accuracy and ecological interpretability. Among all algorithms, XGBoost performed best, achieving an accuracy of 0.920 and F1-scores of 0.926 (fire) and 0.912 (non-fire), followed by LightGBM (508–511 in conclusion, AUC = 0.965), confirming the superior capability of gradient boosting approaches over conventional classifiers such as CART, k-NN, and SVM in complex wildfire risk modeling.

SHAP analysis identified NDVI as the most influential predictor (mean SHAP = 3.0 for LightGBM and 2.0 for XGBoost), underscoring vegetation density as a key determinant of fuel availability. Secondary drivers varied between models—LightGBM prioritized elevation and wind speed, whereas XGBoost emphasized land surface temperature and wind speed—reflecting model-specific sensitivities to environmental controls. Interaction effects revealed critical synergies, particularly between elevated temperatures and strong winds during Chergui events, and between vegetation density and topographic position, emphasizing how compound extremes amplify fire risk. The results confirm that wildfires in Mediterranean ecosystems arise from non-linear interactions among fuel, topography, and meteorological factors, intensified by ongoing regional warming, as observed during the extreme summer of 2022. The high-resolution susceptibility maps produced for the TTA region provide operational insights for agencies such as ANEF, supporting targeted prevention, optimized resource allocation, and climate-informed early warning during concurrent heat and wind extremes.

Despite these advances, limitations persist, including restricted access to ground-based meteorological data, dependence on institutional fire records, and reliance on a single-year dataset. Expanding to multi-year records, improving access to in-situ climate observations, and incorporating climate projections would enhance model robustness and temporal transferability. Future work should integrate climate and socio-economic scenarios to support adaptive wildfire management under evolving Mediterranean climate conditions. Overall, this research establishes an explainable AI–based environmental modeling framework that links predictive analytics with practical fire management, offering a scalable and scientifically grounded approach for safeguarding Morocco’s forest ecosystems amid accelerating climate change.

Data availability statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Author contributions

AM: Conceptualization, Formal analysis, Investigation, Methodology, Supervision, Validation, Visualization, Writing – original draft, Writing – review & editing, Project administration, Software. AK: Conceptualization, Data curation, Formal analysis, Methodology, Validation, Visualization, Writing – original draft, Software, Supervision. AE: Formal analysis, Investigation, Methodology, Validation, Writing – original draft, Writing – review & editing, Resources. WA: Formal analysis, Funding acquisition, Resources, Writing – review & editing, Project administration. NR: Formal analysis, Resources, Visualization, Writing – review & editing. YY: Formal analysis, Visualization, Writing – review & editing, Funding acquisition, Methodology, Project administration, Supervision.

Funding

The author(s) declare that financial support was received for the research and/or publication of this article. This research was funded by the Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2025R680), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.

Acknowledgments

The authors extend their appreciation to the Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2025R680), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia. The authors also gratefully acknowledge USGS–NASA for providing access to Landsat and DEM datasets via the Google Earth Engine platform.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Correction note

This article has been corrected with minor changes. These changes do not impact the scientific content of the article.

Generative AI statement

The authors declare that no Gen AI was used in the creation of this manuscript.

Any alternative text (alt text) provided alongside figures in this article has been generated by Frontiers with the support of artificial intelligence and reasonable efforts have been made to ensure accuracy, including review by the authors wherever possible. If you identify any issues, please contact us.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Abbreviations

AUC, Area under the Curve (ROC curve); CART, Classification and Regression Trees; Dis_Road, Distance to roads; Dis_Settle, Distance to settlements; Dis_Stream, Distance to streams; Elevation, Elevation above sea level; k-NN, k-Nearest Neighbors; LightGBM, Light Gradient Boosting Machine; LST, Land Surface Temperature; NDVI, Normalized Difference Vegetation Index; SHAP, SHapley Additive exPlanations; Slope, Slope gradient/steepness; SVM, Support Vector Machine; TTA, Tangier–Tétouan–Al Hoceima region; TWI, Topographic Wetness Index; WindSpeed, Wind speed; XAI, Explainable Artificial Intelligence; XGBoost, Extreme Gradient Boosting.

References

Abdollahi, A., and Pradhan, B. (2023). Explainable artificial intelligence (XAI) for interpreting the contributing factors feed into the wildfire susceptibility prediction model. Sci. Total Environ. 879:163004. doi: 10.1016/j.scitotenv.2023.163004

PubMed Abstract | Crossref Full Text | Google Scholar

Abujayyab, S. K. M., Kassem, M. M., Khan, A. A., Wazirali, R., Coşkun, M., Taşoğlu, E., et al. (2022). Wildfire susceptibility mapping using five boosting machine learning algorithms: the case study of the Mediterranean region of Turkey. Adv. Civil Eng. 2022:3959150. doi: 10.1155/2022/3959150

Crossref Full Text | Google Scholar

Ahmad, H., Wu, Z., Huang, H., Muhammad, S., Hayat, M., Abbas, K., et al. (2025). A comparative evaluation of forest fire hazard vulnerability through geographic information system-based techniques. Front. For. Glob. Change 8:1635041. doi: 10.3389/ffgc.2025.1635041

Crossref Full Text | Google Scholar

Badda, H., Cherif, E. K., Boulaassal, H., Wahbi, M., Yazidi Alaoui, O., Maatouk, M., et al. (2023). Improving the accuracy of random forest classifier for identifying burned areas in the Tangier-Tetouan-Al Hoceima region using Google earth engine. Remote Sens 15:4226. doi: 10.3390/rs15174226

Crossref Full Text | Google Scholar

Bar, S., Parida, B. R., Pandey, A. C., Shankar, B. U., Kumar, P., Panda, S. K., et al. (2023). Modeling and prediction of fire occurrences along an elevational gradient in Western Himalayas. Appl. Geogr. 151:102867. doi: 10.1016/j.apgeog.2022.102867

Crossref Full Text | Google Scholar

Bilucan, F., Teke, A., and Kavzoglu, T. (2024). “Susceptibility mapping of wildfires using XGBoost, random Forest and AdaBoost: a case study of Mediterranean ecosystem” in Recent research on geotechnical engineering, remote sensing, geophysics and earthquake seismology. eds. M. Bezzeghoud, Z. A. Ergüler, J. Rodrigo-Comino, M. K. Jat, R. Kalatehjari, and D. S. Bisht (Cham, Switzerland: Springer Nature), 99–101.

Google Scholar

Boubekraoui, H., Maouni, Y., Ghallab, A., Draoui, M., and Maouni, A. (2023). Wildfires risk assessment using hotspot analysis and results application to wildfires strategic response in the region of Tangier-Tetouan-Al Hoceima, Morocco. Fire 6:314. doi: 10.3390/fire6080314

Crossref Full Text | Google Scholar

Chen, X., Zhang, Y., Wang, S., Zhao, Z., Liu, C., and Wen, J. (2024). Comparative study of machine learning methods for mapping forest fire areas using sentinel-1B and 2A imagery. Front. Remote Sens. 5:641. doi: 10.3389/frsen.2024.1446641

Crossref Full Text | Google Scholar

Cilli, R., Elia, M., D’Este, M., Giannico, V., Amoroso, N., Lombardi, A., et al. (2022). Explainable artificial intelligence (XAI) detects wildfire occurrence in the Mediterranean countries of southern Europe. Sci. Rep. 12:16349. doi: 10.1038/s41598-022-20347-9

PubMed Abstract | Crossref Full Text | Google Scholar

Couto, F. T., Santos, F. L. M., Campos, C., Andrade, N., Purificação, C., and Salgado, R. (2022). Is Portugal starting to burn all year long? The transboundary fire in January 2022. Atmos. 13:1677. doi: 10.3390/atmos13101677

Crossref Full Text | Google Scholar

Davis, F. W., Parkinson, A.-M., Moritz, M. A., Park, I. W., and D’Antonio, C. M. (2025). Increasing vulnerability of an endemic Mediterranean-climate conifer to changing climate and fire regime. Front. For. Glob. Change 8:1516623. doi: 10.3389/ffgc.2025.1516623

Crossref Full Text | Google Scholar

Ejaz, N., and Choudhury, S. (2025). A comprehensive survey of the machine learning pipeline for wildfire risk prediction and assessment. Ecol. Inform. 90:103325. doi: 10.1016/j.ecoinf.2025.103325

Crossref Full Text | Google Scholar

Estes, B. L., Knapp, E. E., Skinner, C. N., Miller, J. D., and Preisler, H. K. (2017). Factors influencing fire severity under moderate burning conditions in the Klamath Mountains, northern California, USA. Ecosphere 8:e01794. doi: 10.1002/ecs2.1794

Crossref Full Text | Google Scholar

Gentilucci, M., Barbieri, M., Younes, H., Rihab, H., and Pambianchi, G. (2024). Analysis of wildfire susceptibility by weight of evidence, using geomorphological and environmental factors in the Marche region, Central Italy. Geosciences 14:112. doi: 10.3390/geosciences14050112

Crossref Full Text | Google Scholar

Gentilucci, M., Younes, H., Hadji, R., Casagli, N., and Pambianchi, G. (2025). Influence of land surface temperatures, precipitation, total water storage anomaly and fraction of absorbed photosynthetically active radiation anomaly, obtained from MODIS, IMERG and GRACE satellite products on wildfires in eastern Central Italy. Int. J. Remote Sens. 46, 5465–5499. doi: 10.1080/01431161.2025.2522941

Crossref Full Text | Google Scholar

HCP. (2014). Recensement général de la population et de l’habitat. Haut-Commissariat au Plan du Maroc. Available online at: http://rgphentableaux.hcp.ma/Default1/ (Accessed April 16, 2020).

Google Scholar

Iban, M. C., and Aksu, O. (2024). SHAP-driven explainable artificial intelligence framework for wildfire susceptibility mapping using MODIS active fire pixels: an in-depth interpretation of contributing factors in Izmir, Türkiye. Remote Sens 16:2842. doi: 10.3390/rs16152842

Crossref Full Text | Google Scholar

Islam, T., Hulley, G. C., Malakar, N. K., Radocinski, R. G., Guillevic, P. C., and Hook, S. J. (2017). A physics-based algorithm for the simultaneous retrieval of land surface temperature and emissivity from VIIRS thermal infrared data. IEEE Trans. Geosci. Remote Sens. 55, 563–576. doi: 10.1109/TGRS.2016.2611566

Crossref Full Text | Google Scholar

L’Economiste. (2023). Incendies de forêts au Maroc: plus de 22.700 ha touchés en 2022. L’Economiste. Available online at: https://www.leconomiste.com/flash-infos/incendies-de-forets-au-maroc-plus-de-22-700-ha-touches-en-2022/ (Accessed October 19, 2025).

Google Scholar

Lee, C., Choi, E. H., Han, Y., and Lee, Y. (2025). Year-round daily wildfire prediction and key factor analysis using machine learning: a case study of Gangwon state, South Korea. Sci. Rep. 15:29910. doi: 10.1038/s41598-025-15508-5

PubMed Abstract | Crossref Full Text | Google Scholar

Liao, B., Zhou, T., Liu, Y., Li, M., and Zhang, T. (2025). Tackling the wildfire prediction challenge: an explainable artificial intelligence (XAI) model combining extreme gradient boosting (XGBoost) with SHapley additive exPlanations (SHAP) for enhanced interpretability and accuracy. Forests 16:689. doi: 10.3390/f16040689

Crossref Full Text | Google Scholar

Michael, Y., Helman, D., Glickman, O., Gabay, D., Brenner, S., and Lensky, I. M. (2021). Forecasting fire risk with machine learning and dynamic information derived from satellite vegetation index time-series. Sci. Total Environ. 764:142844. doi: 10.1016/j.scitotenv.2020.142844

PubMed Abstract | Crossref Full Text | Google Scholar

Molina, J. R., Lora, A., Prades, C., and Rodríguez Y Silva, F. (2019). Roadside vegetation planning and conservation: new approach to prevent and mitigate wildfires based on fire ignition potential. For. Ecol. Manag. 444, 163–173. doi: 10.1016/j.foreco.2019.04.034

Crossref Full Text | Google Scholar

Moumane, A., Elmotawakkil, A., Hasan, M. M., Kranjčić, N., Batchi, M., Karkouri, J. A., et al. (2025). Integrating GIS, remote sensing, and machine learning to optimize sustainable groundwater recharge in arid Mediterranean landscapes: a case study from the middle Draa Valley, Morocco. Water 17:2336. doi: 10.3390/w17152336

Crossref Full Text | Google Scholar

Niranjan, A. (2025). EU wildfires worst on record as burning season continues. The Guardian. Available online at: https://www.theguardian.com/world/2025/aug/22/eu-wildfires-worst-year-on-record-as-season-continues (Accessed August 23, 2025).

Google Scholar

Nur, A. S., Kim, Y. J., and Lee, C.-W. (2022). Creation of wildfire susceptibility maps in Plumas National Forest using InSAR coherence, deep learning, and metaheuristic optimization approaches. Remote Sens 14:4416. doi: 10.3390/rs14174416

Crossref Full Text | Google Scholar

Pacheco, A. d. P., Junior, J. A. d. S., Ruiz-Armenteros, A. M., and Henriques, R. F. F. (2021). Assessment of k-nearest neighbor and random forest classifiers for mapping forest fire areas in Central Portugal using Landsat-8, Sentinel-2, and Terra imagery. Remote Sens 13:1345. doi: 10.3390/rs13071345

Crossref Full Text | Google Scholar

Parajuli, A., Gautam, A. P., Sharma, S. P., Bhujel, K. B., Sharma, G., Thapa, P. B., et al. (2020). Forest fire risk mapping using GIS and remote sensing in two major landscapes of Nepal. Geomatics Nat. Hazards Risk 11, 2569–2586. doi: 10.1080/19475705.2020.1853251

Crossref Full Text | Google Scholar

Purnama, M. I., Jaya, I. N. S., Syaufina, L., Çoban, H. O., and Raihan, M. (2024). Predicting forest fire vulnerability using machine learning approaches in the Mediterranean region: a case study of Türkiye. IOP Conf. Ser.: Earth Environ. Sci. 1315:012056. doi: 10.1088/1755-1315/1315/1/012056

Crossref Full Text | Google Scholar

RGPH. (2024). Recensement général de la population et de l’habitat 2014. Available online at: https://www.recensement.ma/ (Accessed December 4, 2023).

Google Scholar

Rodrigues, M., Cunill Camprubí, À., Balaguer-Romano, R., Coco Megía, C. J., Castañares, F., Ruffault, J., et al. (2023). Drivers and implications of the extreme 2022 wildfire season in Southwest Europe. Sci. Total Environ. 859:160320. doi: 10.1016/j.scitotenv.2022.160320

PubMed Abstract | Crossref Full Text | Google Scholar

Rogeau, M.-P., and Armstrong, G. W. (2017). Quantifying the effect of elevation and aspect on fire return intervals in the Canadian Rocky Mountains. For. Ecol. Manag. 384, 248–261. doi: 10.1016/j.foreco.2016.10.035

Crossref Full Text | Google Scholar

Salis, M., Del Giudice, L., Alcasena-Urdiroz, F., Jahdi, R., Arca, B., Pellizzaro, G., et al. (2023). Assessing cross-boundary wildfire hazard, transmission, and exposure to communities in the Italy-France maritime cooperation area. Front. For. Glob. Change 6:1241378. doi: 10.3389/ffgc.2023.1241378

Crossref Full Text | Google Scholar

Seddouki, M., Benayad, M., Aamir, Z., Tahiri, M., Maanan, M., and Rhinane, H. (2023). Using machine learning coupled with remote sensing for forest fire susceptibility mapping. Case study Tetouan province, Northern Morocco. Int. Arch. Photogramm. Remote Sens. Spatial Inf. Sci. 4, 333–342. doi: 10.5194/isprs-archives-XLVIII-4-W6-2022-333-2023

Crossref Full Text | Google Scholar

Serrano-Notivoli, R., Tejedor, E., Sarricolea, P., Meseguer-Ruiz, O., De Luis, M., Saz, M. Á., et al. (2023). Unprecedented warmth: a look at Spain’s exceptional summer of 2022. Atmos. Res. 293:106931. doi: 10.1016/j.atmosres.2023.106931

Crossref Full Text | Google Scholar

Sheriff, R., Meer, M. S., Aslam, R. W., and Said, Y. (2025). Machine learning-based forest fire susceptibility mapping using random forest and CART models. Rangeland Ecol. Manag. 102, 96–109. doi: 10.1016/j.rama.2025.06.004

Crossref Full Text | Google Scholar

Singha, C., Swain, K. C., Moghimi, A., Foroughnia, F., and Swain, S. K. (2024). Integrating geospatial, remote sensing, and machine learning for climate-induced forest fire susceptibility mapping in Similipal Tiger Reserve, India. For. Ecol. Manag. 555:121729. doi: 10.1016/j.foreco.2024.121729

Crossref Full Text | Google Scholar

Sultan, Y. E. D., Pillai, K. R. A., and Sharma, A. (2025). Wildfire indicators modeling for reserved forest of Vellore district (Tamil Nadu, India). Front. Remote Sens. 6:1518539. doi: 10.3389/frsen.2025.1518539

Crossref Full Text | Google Scholar

Symeonidis, P., Vafeiadis, T., Ioannidis, D., and Tzovaras, D. (2025). Wildfire susceptibility mapping in Greece using ensemble machine learning. Earth 6:75. doi: 10.3390/earth6030075

Crossref Full Text | Google Scholar

Toutate, I. (2025). Firefighters Control Three of Four Main Blazes in Morocco’s Chefchaouen Forest. Morocco World News. Available online at: https://www.moroccoworldnews.com/2025/08/237552/firefighters-control-three-of-four-main-blazes-in-moroccos-chefchaouen-forest/ (Accessed October 19, 2025).

Google Scholar

Vallet, L., Schwartz, M., Ciais, P., Van Wees, D., De Truchis, A., and Mouillot, F. (2023). High-resolution data reveal a surge of biomass loss from temperate and Atlantic pine forests, contextualizing the 2022 fire season distinctiveness in France. Biogeosciences 20, 3803–3825. doi: 10.5194/bg-20-3803-2023

Crossref Full Text | Google Scholar

Yue, W., Ren, C., Liang, Y., Liang, J., Lin, X., Yin, A., et al. (2023). Assessment of wildfire susceptibility and wildfire threats to ecological environment and urban development based on GIS and multi-source data: a case study of Guilin, China. Remote Sens 15:2659. doi: 10.3390/rs15102659

Crossref Full Text | Google Scholar

Zhao, E., Wang, N., Cui, S., Zhao, R., and Yu, Y. (2025). A new weighted rough set and improved BP neural network method for predicting forest fires. Reliab. Eng. Syst. Saf. 263:111206. doi: 10.1016/j.ress.2025.111206

Crossref Full Text | Google Scholar

Keywords: machine learning, SHAP, wildfire, climatic extremes, Mediterranean region

Citation: Moumane A, Al Karkouri A, Elmotawakkil A, Alkhuraiji WS, Rebouh NY and Youssef YM (2025) Advancing wildfire susceptibility mapping through maching learning and SHapley Additive exPlanations-integrated geospatial analysis in Northern Morocco’s Mediterranean region. Front. For. Glob. Change. 8:1705341. doi: 10.3389/ffgc.2025.1705341

Received: 14 September 2025; Revised: 30 October 2025; Accepted: 17 November 2025;
Published: 01 December 2025;
Corrected: 03 December 2025.

Edited by:

Himanshu Bargali, Gurukul Kangri University, India

Reviewed by:

Liao Bin, Guizhou University of Finance and Economics, China
Oktay Aksu, Istanbul Okan Universitesi, Türkiye

Copyright © 2025 Moumane, Al Karkouri, Elmotawakkil, Alkhuraiji, Rebouh and Youssef. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Adil Moumane, YWRpbC5tb3VtYW5lQHVpdC5hYy5tYQ==; Youssef M. Youssef, eW91c3NlZi5pYnJhaGltQHBtZS5zdWV6dW5pLmVkdS5lZw==

ORCID: Adil Moumane, orcid.org/0000-0003-0296-2679
Adnane Al Karkouri, orcid.org/0009-0000-5957-658X

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.