Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Artif. Intell.

Sec. Machine Learning and Artificial Intelligence

Medical Pattern Classification Using a Novel Binary Similarity Approach based on an Associative Classifier

Provisionally accepted
Osvaldo  Velázquez-GonzálezOsvaldo Velázquez-González1Antonio  Alarcón-ParedesAntonio Alarcón-Paredes1Cornelio  Yáñez-MárquezCornelio Yáñez-Márquez1,2*
  • 1Centro de Investigación en Computación, Instituto Politécnico Nacional (CIC-IPN), Mexico City, México, Mexico
  • 2National Polytechnic Institute (IPN), Mexico City, Mexico

The final, formatted version of the article will be published soon.

Classification is a central task in machine learning, underpinning applications in domains such as finance, medicine, engineering, information technology and biology. However, machine learning pattern classification can become a complicated or totally inexplicable task for current robust models due to the complexity that objective datasets can have, which is why there is a high interest in achieving a good performance classification. On the other hand, there is a need in particular cases to achieve such performance but maintaining a certain explainability in the operation and decisions of classification algorithms, which can become complex to have. For this reason, an algorithm is proposed that is robust but simple and highly explainable in datasets mainly related to medicine and with complexity in class imbalance. The main contribution of this research is a novel approach to a new machine learning classification algorithm based on binary string similarity, which is competitive but simple, interpretable and transparent, since it is known why a pattern is classified within a certain class. Therefore, a comparative study of the performance of the best-known state-of-the-art classification algorithms and the proposed model is presented. The experimental results demonstrate the benefits of the proposal of this research work, which were validated with the purpose of finding significant differences in performance using statistical hypothesis tests.

Keywords: Binary similarity, Classification algorithms, machine learning, medicine dataset, Pattern Classification, pattern recognition

Received: 13 Apr 2025; Accepted: 11 Dec 2025.

Copyright: © 2025 Velázquez-González, Alarcón-Paredes and Yáñez-Márquez. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Cornelio Yáñez-Márquez

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.