ORIGINAL RESEARCH article

Front. Robot. AI

Sec. Robot Vision and Artificial Perception

Volume 12 - 2025 | doi: 10.3389/frobt.2025.1552068

This article is part of the Research TopicVision AI in Robotic Perception and MappingView all articles

Task-specific CNN size reduction through content-specific pruning

Provisionally accepted
Nurbek  KonyrbaevNurbek Konyrbaev1*Martin  LukacMartin Lukac2Askhat  DiveevAskhat Diveev3Elena  SofronovaElena Sofronova3Sabit  IbadullaSabit Ibadulla1Asem  GalymzhankyzyAsem Galymzhankyzy1
  • 1Korkyt Ata Kyzylorda State University, Kyzylorda, Kazakhstan
  • 2Hiroshima City University, Hiroshima, Japan
  • 3Russian Academy of Sciences (RAS), Moscow, Moscow Oblast, Russia

The final, formatted version of the article will be published soon.

The widespread and growing use of flying unmanned autonomous vehicles (UAV) is attributed to their high spatial mobility, autonomous control, and lower cost when compared to usual manned flying vehicles. Applications such as surveying, searching, or scanning the environment with application-specific sensors have made extensive use of UAVs in fields like agriculture, geography, forestry, or biology. However, due to the large number of applications and types of UAVs, limited power resources have to be taken into account when designing task-specific software for a target UAV. The power constraints of, in particular, smaller UAVs will in general hinder their mobility, limit the sensors, or limit their distance from the control transmitter. These constraints will in general require that the UAV have a high level of autonomy. Reducing the overhead of the control and decision-making software onboard is one approach to increasing the autonomy of UAVs. Specifically, we can optimize and minimize the task-specific design of onboard control software more effectively than a general-purpose algorithm. In this work, we focus on reducing the size of the computer vision object classification algorithm. We define different tasks by specifying which objects the UAV must recognize, and we construct a convolutional neural network (CNN) for such a specific classification. However, rather than creating a custom CNN that requires its dataset, we begin with a pre-trained general-purpose classifier. We then choose specific groups of objects to recognize, and by using response-based pruning (RBP), we simplify the general-purpose CNN to fit our specific needs. We evaluate the pruned models in various scenarios. The results indicate that the evaluated task-specific pruning can reduce the size of the neural model and increase the accuracy of the classification tasks. For small UAVs intended for tasks with reduced visual content, the proposed method solves both the size reduction and individual model training problems.

Keywords: machine learning, Computer Vision, Classification, pruning, Noisy data

Received: 27 Dec 2024; Accepted: 19 May 2025.

Copyright: © 2025 Konyrbaev, Lukac, Diveev, Sofronova, Ibadulla and Galymzhankyzy. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Nurbek Konyrbaev, Korkyt Ata Kyzylorda State University, Kyzylorda, Kazakhstan

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.