Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Environ. Sci.

Sec. Environmental Informatics and Remote Sensing

Volume 13 - 2025 | doi: 10.3389/fenvs.2025.1633046

Accelerated Bayesian Optimization for CNN+LSTM Learning Rate Tuning via Precomputed Gaussian Process Subspaces in Soil Analysis

Provisionally accepted
Xiaolong  ChenXiaolong ChenHongfeng  ZhangHongfeng Zhang*Cora  UnCora Un*zhengchun  songzhengchun song
  • Macao Polytechnic University, Macau, Macao, SAR China

The final, formatted version of the article will be published soon.

We propose an accelerated Bayesian optimization framework for tuning the learning rate of CNN+LSTM models in soil analysis, addressing the computational inefficiency of traditional Gaussian Process (GP)-based methods. The key innovation lies in a subspace-accelerated GP surrogate model that precomputes low-rank approximations of covariance matrices offline, thereby decoupling the costly hyperparameter tuning from the online acquisition function evaluations. By projecting the hyperparameter search space onto a dominant subspace derived from Nyström approximations, our method reduces the computational complexity from cubic to linear in the number of observations, enabling real-time optimization without sacrificing probabilistic rigor. The proposed system integrates seamlessly with existing CNN+LSTM pipelines, where the offline phase constructs the GP subspace using historical or synthetic data, while the online phase iteratively updates the subspace with rank-1 modifications. Empirical validation on soil spectral datasets demonstrates a 3–5× speedup in convergence compared to standard Bayesian optimization, with no loss in model accuracy. Experiments on soil spectral datasets show convergence in 23.4 minutes (3.8× faster than standard Bayesian optimization) with a test RMSE of 0.142, while maintaining equivalent accuracy across diverse CNN+LSTM architectures. Moreover, the method’s adaptability to non-stationary response surfaces, facilitated by a Matérn-5/2 kernel with automatic relevance determination, makes it particularly suitable for soil data exhibiting multi-scale features. The reformulated approach not only overcomes the scalability limitations of conventional GP-based optimization but also preserves its theoretical guarantees, offering a practical solution for hyperparameter tuning in resource-constrained environments. This work bridges the gap between computational efficiency and probabilistic robustness, with broader implications for automated machine learning in geoscientific applications.

Keywords: Bayesian optimization, CNN+LSTM, soil analysis, Gaussian process, Computational efficiency, hyperparameter tuning

Received: 23 May 2025; Accepted: 22 Jul 2025.

Copyright: © 2025 Chen, Zhang, Un and song. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Hongfeng Zhang, Macao Polytechnic University, Macau, Macao, SAR China
Cora Un, Macao Polytechnic University, Macau, Macao, SAR China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.