Your new experience awaits. Try the new design now and help us make it even better

ORIGINAL RESEARCH article

Front. Artif. Intell.

Sec. Machine Learning and Artificial Intelligence

Volume 8 - 2025 | doi: 10.3389/frai.2025.1612080

This article is part of the Research TopicDeep Learning for Computer Vision and Measurement SystemsView all 3 articles

Evaluation of Vision Transformers for the detection of fullness of garbage bins for efficient waste management

Provisionally accepted
Parakram  Singh TanwerParakram Singh Tanwer1SHISHIR  MAHESHWARISHISHIR MAHESHWARI2Sushree  BeheraSushree Behera1Amit  ChauhanAmit Chauhan3Sunil  Kumar TelagamsettiSunil Kumar Telagamsetti4*
  • 1International Institute of Information Technology, Bangalore, Bangalore, Karnataka, India
  • 2Motilal Nehru National Institute of Technology Allahabad, Allahabad, Uttar Pradesh, India
  • 3Thapar Institute of Engineering & Technology, Patiala, Punjab, India
  • 4University of Gävle, Gävle, Sweden

The final, formatted version of the article will be published soon.

Efficient waste management is crucial for urban environments to maintain cleanliness, reduce environmental impact, and optimize resource allocation. Traditional waste collection systems often rely on scheduled pickups or manual inspections, leading to inefficient resource utilization and potential overflow issues. This paper presents a novel approach to automate the detection of garbage container fullness from images using machine learning techniques. More specifically, we explore three transformer-based architectures, namely, vision transformer, Swin transformer, and pyramid vision transformer to classify input images of garbage bins as clean or dirty. Our experimental results on the publicly available Clean dirty containers in Montevideo dataset suggest that transformer-based architectures are effective in garbage fullness detection. Moreover, a comparison with existing methods reveals that the proposed approach using the vision transformer surpasses the state-of-the-art, achieving a 96.74% accuracy in detecting garbage container fullness.

Keywords: vision Transformer, Garbage classification, Pyramid vision transformer, Swin (Shifted window) UNetR(UNetTransformer), garbage fullness detection

Received: 15 Apr 2025; Accepted: 18 Aug 2025.

Copyright: © 2025 Tanwer, MAHESHWARI, Behera, Chauhan and Telagamsetti. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Sunil Kumar Telagamsetti, University of Gävle, Gävle, Sweden

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.