ORIGINAL RESEARCH article

Front. Mar. Sci.

Sec. Ocean Solutions

Volume 12 - 2025 | doi: 10.3389/fmars.2025.1631650

This article is part of the Research TopicBig Data and AI for Sustainable Maritime OperationsView all articles

Multi-source data-driven Bayesian network for risk analysis of maritime accidents in the high sea

Provisionally accepted
Xiaotong  QuXiaotong Qu1Chengbo  WangChengbo Wang2*Ruijia  ZhaoRuijia Zhao1Mingxing  FangMingxing Fang3Xinlian  XieXinlian Xie1
  • 1College of Transportation Engineering, Dalian Maritime University, Dalian, China
  • 2University of Science and Technology of China, Hefei, Anhui Province, China
  • 3Anhui Normal University, Wuhu, Anhui Province, China

The final, formatted version of the article will be published soon.

Incomplete data significantly hampers risk analysis for high-sea maritime accidents (HSMAs). This paper introduces a novel multi-source data-driven Bayesian network (DDBN) framework to address this limitation. This framework initially integrates heterogeneous data from multiple sources, including accident reports, ship characteristics, environmental conditions, etc. The structural learning of the DDBN employs a hybrid, tri-source enhanced methodology. Informed by a two-stage risk evolution theory, this approach integrates evidence from structured data, text analysis, and expert knowledge to construct a unified network structure, ensuring that derived causal relationships align with both statistical evidence and domain expertise. Subsequently, the Expectation-Maximization algorithm is employed for parameter estimation to handle missing data. The findings indicate that although the accident type, sea area and ship type all contribute to the risk level of HSMAs, the gross tonnage is the most critical factor that directly affects the likelihood of an accident. The DDBN model achieves an accident prediction accuracy of 84.0%, with both precision and recall exceeding 75%. Furthermore, DDBN-based scenario analysis proactively identifies high-risk scenarios associated with specific accident types and gross tonnage, offering maritime authorities and operators an enhanced tool for risk assessment. This study provides a scientific basis for formulating targeted HSMA prevention strategies.

Keywords: Maritime safety, Maritime accidents, data-driven Bayesian network, risk analysis, High Sea, incomplete data

Received: 20 May 2025; Accepted: 10 Jun 2025.

Copyright: © 2025 Qu, Wang, Zhao, Fang and Xie. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Chengbo Wang, University of Science and Technology of China, Hefei, 230026, Anhui Province, China

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.