Deep-Learning Based Tomato Leaf Disease Classification using CNN, ConvNeXt, Vision Transformer, and Swin Transformer
DOI:
https://doi.org/10.3126/injet.v3i2.95510Keywords:
Tomato leaf disease classification, deep learning, convolutional neural networks, ConvNeXt, Vision Transformer, Swin Transformer, deployment-aware AI, plant disease detection, agricultural AI, image classificationAbstract
Deep-learning-based tomato leaf disease classification remains challenging because it must address visually similar disease patterns, non-target inputs, and computational constraints. This paper presents a deployment-aware comparative study of four deep learning model families—Convolutional Neural Networks (CNN), ConvNeXt, Vision Transformer (ViT), and Swin Transformer—for tomato leaf disease classification. We used a dataset comprising 31,042 training images and 5,643 validation images and include a not_tomato rejection class to better reflect real-world deployment beyond closed-set classification. We also applied HSV-based leaf-focused background removal and evaluated model accuracy, loss behavior, precision, recall, and F1-score. Results showed that Swin Transformer achieved the best overall performance with 98.69% validation accuracy and superior plant detection capability. ViT demonstrated the most stable generalization behavior, while ConvNeXt remained competitive but computationally expensive. The custom CNN offered high efficiency but lower accuracy. These findings highlight the importance of balancing accuracy, generalization, and deployment constraints in agricultural AI systems.
Downloads
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 International Journal on Engineering Technology

This work is licensed under a Creative Commons Attribution 4.0 International License.
This license enables reusers to distribute, remix, adapt, and build upon the material in any medium or format, so long as attribution is given to the creator. The license allows for commercial use.