Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Big Data

Sec. Machine Learning and Artificial Intelligence

Volume 8 - 2025 | doi: 10.3389/fdata.2025.1676477

This article is part of the Research TopicFrontiers in Information Technology, Electronics, and Management InnovationView all 4 articles

LLM-supported collaborative ontology design for data and knowledge management platforms

Provisionally accepted
  • 1Riga Technical University, Riga, Latvia
  • 2Max-Planck-Institut fur Nachhaltige Materialien GmbH, Düsseldorf, Germany
  • 3Teknologian tutkimuskeskus VTT Oy, Espoo, Finland

The final, formatted version of the article will be published soon.

The management of vast, heterogeneous, and multidisciplinary data presents a critical challenge across scientific domains, hindering interoperability and slowing scientific progress. This paper addresses this challenge by presenting a pragmatic extension to the NeOn iterative ontology engineering framework, a well-established methodology for collaborative ontology design, which integrates Large Language Models (LLMs) to accelerate key tasks while retaining domain expert-in-the-loop validation. The methodology was applied within the HyWay project, an EU-funded research initiative on hydrogen–materials interactions, to develop the Hydrogen-Material Interaction Ontology (HMIO), a domain-specific ontology covering 29 experimental methods and 14 simulation types for assessing interactions between hydrogen and advanced metallic materials. A key result is the successful integration of the HMIO into a Data and Knowledge Management Platform (DKMP), where it drives the automated generation of data entry forms, ensuring that all captured data is FAIR (Findable, Accessible, Interoperable, and Reusable) and HMIO compliant by design. The validation of this approach demonstrates that this hybrid human-machine workflow for ontology engineering and further integration with the DKMP is an effective and efficient strategy for creating and operationalising complex scientific ontologies, thereby providing a scalable solution to advance data-driven research in materials science and other complex scientific domains.

Keywords: Ontology design, Large language models, FAIR, experiment, simulation, Hydrogen, Metals

Received: 31 Jul 2025; Accepted: 20 Oct 2025.

Copyright: © 2025 Kampars, Mosans, Jogi, Roters and Vajragupta. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Janis Kampars, janis.kampars@rtu.lv

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.