Building of Informatics, Technology and Science
Vol 4 No 3 (2022): Desember 2022

Pengaruh Data Preprocessing terhadap Imbalanced Dataset pada Klasifikasi Citra Sampah menggunakan Algoritma Convolutional Neural Network

Muhammad Resa Arif Yudianto (Universitas Muhammadiyah Magelang, Magelang)
Pristi Sukmasetya (Universitas Muhammadiyah Magelang, Magelang)
Rofi Abul Hasani (Universitas Muhammadiyah Magelang, Magelang)
Dimas Sasongko (Universitas Muhammadiyah Magelang, Magelang)



Article Info

Publish Date
26 Dec 2022

Abstract

Garbage is one of Indonesia's most significant problems with an increase in waste each year reaching 187.2 million tonnes/year. Various efforts to reduce the amount of waste such as Garbage Banks have been encouraged. However, this program has not run well, because some people have difficulty distinguishing the type of waste. One solution to overcome this problem is that need a system that can classify the type of waste. The deep learning approach with the CNN algorithm is currently widely used to solve classification problems. This method requires a large number of datasets to increase the level of accuracy. Getting a garbage dataset is a particular problem in the training process because the dataset is unbalanced. The dataset used amounted to 2527 data consisting of 6 classes. Several treatments such as undersampling and image augmentation are applied to overcome imbalanced datasets. Other treatments such as the type of input image channel and the use of filters are combined into 24 experimental scenarios to achieve the highest accuracy. The results of the experiment get the best scenario, namely, the dataset is undersampling and then augmented with 5 geometric transformation parameters with the input image being RGB and applying a sharpening filter to get an accuracy value of 0.9919 with 20 epochs.

Copyrights © 2022






Journal Info

Abbrev

bits

Publisher

Subject

Computer Science & IT

Description

Building of Informatics, Technology and Science (BITS) is an open access media in publishing scientific articles that contain the results of research in information technology and computers. Paper that enters this journal will be checked for plagiarism and peer-rewiew first to maintain its quality. ...