ORIGINAL RESEARCH article

Front. Built Environ.

Sec. Geotechnical Engineering

Volume 11 - 2025 | doi: 10.3389/fbuil.2025.1620796

This article is part of the Research TopicEmerging Artificial Intelligence tools in Geotechnical Engineering AdvancementsView all 5 articles

Estimating Shield Tunnel Boring Machine Penetration Rate in Mixed Face Conditions: Feature Selection and Multicollinearity Effects on Machine and Deep Learning Models

Provisionally accepted
Jitendra  KhattiJitendra Khatti1*Swapnil  MishraSwapnil Mishra2
  • 1Jodhpur Institute of Engineering and Technology, Jodhpur, India
  • 2Indian Institute of Technology (Indian School of Mines) Dhanbad, Dhanbad, India

The final, formatted version of the article will be published soon.

This research compares the support vector machine (SVM), gene expression programming (GEP), feedforward neural network (FFNN), gated recurrent unit (GRU), long short-term memory (LSTM), support vector regressor (SVR), and bidirectional long short-term memory (BiLSTM) models in predicting penetration (PR) rate of earth pressure balance shield tunnel boring machine (ETBM). A dataset has been compiled using the cutterhead rotation speed (CRS), mean thrust (F/A), mean cutterhead torque (T/D3), upper earth pressure (UEP), lower earth pressure (LEP), and torque penetration index (TPI) features of 1197 ETBM events. The feature of multicollinearity was analyzed using the variance inflation factor (VIF) method. It was observed that CRS, F/A, T/D3, UEP, LEP, and TPI have weak, moderate, considerable, moderate, problematic, and considerable multicollinearity, respectively. The performance (R) comparison revealed that the BiLSTM models predicted PR (= 1.0000 in testing and validation) with higher performance than SVM, SVR, GEP, FFNN, GRU, and LSTM models. In addition, the score analysis (= 285), error characteristics curve (= 7.03E-07), generalizability (m & n < 0.00), Wilcoxon test (confidence = 95.02%), uncertainty analysis (first rank), Anderson-Darling test (accept the normality hypothesis), and objective function criterion (= 0.0003) presented that the BiLSTM model is an optimal performance computational model in predicting PR of ETBM. It was also noted that the CRS, F/A, T/D3, UEP, LEP, and TPI features are more reliable for accurately predicting PR.

Keywords: Bidirectional Long Short-Term Memory, multicollinearity, Penetration rate, Tunnel boring machine, Feature Selection

Received: 30 Apr 2025; Accepted: 15 Jul 2025.

Copyright: © 2025 Khatti and Mishra. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Jitendra Khatti, Jodhpur Institute of Engineering and Technology, Jodhpur, India

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.