Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Artif. Intell.

Sec. Machine Learning and Artificial Intelligence

Volume 8 - 2025 | doi: 10.3389/frai.2025.1663891

This article is part of the Research TopicAI-Driven Hybrid Group Intelligence Decision-MakingView all articles

A Chinese Question and Answer System for Liver Cancer based on Knowledge Graph and Large Language Mode

Provisionally accepted
Haoqi  WuHaoqi Wu1Min  ZhangMin Zhang2Hailing  WangHailing Wang1*Xiaoyan  JiangXiaoyan Jiang1Yongbin  GaoYongbin Gao1Rong  HuangRong Huang1Zhijun  FangZhijun Fang1Xiaojun  HuXiaojun Hu3Yingfang  FanYingfang Fan3*
  • 1Shanghai University of Engineering Sciences, Shanghai, China
  • 2Ninth Hospital of Xi'an Department of Endocrinology, Xi'an, China
  • 3The Third Affiliated Hospital of Southern Medical University, Guangzhou, China

The final, formatted version of the article will be published soon.

Introduction: The liver cancer question-and-answer (Q\&A) system is primarily intended to help patients access disease-related information more conveniently. However, there is currently no Q\&A system specifically developed for liver cancer. Additionally, most existing Q\&A systems lack real clinical data and have limited capability in understanding Chinese questions. Methods: This paper proposes a Chinese liver cancer question-answering system based on knowledge graphs and Large Language Models (LLMs). To unify information from diverse sources, the system employs a knowledge graph to store entities and inter-entity relationships extracted from patients' clinical electronic medical records and the professional medical website xywy.com, which serves as the foundation for the system's responses. Specifically, ChatGLM3.5 is utilized to extract entity information from questions, while BERT is applied to understand users' intent. Subsequently, the system retrieves corresponding information from the knowledge graph. Finally, the retrieved information is integrated, and a natural language response is generated as the answer to the question. Results:The experimental results indicate that in terms of intent classification, our system achieves a precision of 92.34\%, representing an improvement of 1.38\% over the BERT model and 4.32\% over the GEBERT model. In terms of response relevance, the system's outputs are more aligned with patients' daily speech patterns and exhibit higher relevance to the target questions.

Keywords: Large Language Model, Question and answer system, liver cancer, knowledge graph, data integration

Received: 11 Jul 2025; Accepted: 19 Sep 2025.

Copyright: © 2025 Wu, Zhang, Wang, Jiang, Gao, Huang, Fang, Hu and Fan. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence:
Hailing Wang, wanghailing@sues.edu.cn
Yingfang Fan, fanyf068700@sina.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.