IT JOURNAL RESEARCH AND DEVELOPMENT
Vol. 8 No. 1 (2023)

Evaluate of Random Undersampling Method and Majority Weighted Minority Oversampling Technique in Resolve Imabalanced Dataset

Meida Cahyo Untoro (Institut Teknologi Sumatera)
Muhammad Asyroful Nur Maulana Yusuf (Institut Teknologi Sumatera)



Article Info

Publish Date
18 Aug 2023

Abstract

Classification is a model for making predictions based on existing data. Imbalanced data leads to misclassification or modeling errors where the data is not relevant and results in poor classification modeling. A poor classification model is caused by imbalanced data in the classification label, and there is a need for data balancing as a solution to resolve this issue. The methods used to handle data imbalance are Random Undersampling and MWMOTE. The goal is to see the implementation of Random Undersampling and MWMOTE working well in addressing the imbalanced dataset and to know the performance and accuracy in modeling. The dataset used is an open source dataset from Kaggle consisting of Diabetes data, Bank Turnover data, Stroke data, and Credit Card data with various data ratios, with the goal of addressing the problem of imbalanced data. Model evaluation was performed using the confusion matrix and decision tree algorithm by looking at the precision, recall, f-measure, and accuracy values from the Random Undersampling and MWMOTE methods. Random Undersampling can address the problem of imbalanced data with a precision of 76.28%, recall of 76.74%, f-measure of 76.48%, and accuracy of 76.21%. MWMOTE can address the problem of imbalanced data with a precision of 86.04%, recall of 87.30%, f-measure of 86.66%, and accuracy of 86.61%. It can be concluded that the MWMOTE method is better than the Random Undersampling method because the average evaluation of the confusion matrix of the Random Undersampling method is smaller than the MWMOTE method.

Copyrights © 2023






Journal Info

Abbrev

ITJRD

Publisher

Subject

Computer Science & IT Control & Systems Engineering Engineering

Description

Information Technology Journal Research and Development (ITJRD) adalah Jurnal Ilmiah yang dibangun oleh Prodi Teknik Informatika, Universitas Islam Riau untuk memberikan sarana bagi para akademisi dan peneliti untuk mempublikasikan tulisan dan karya ilmiah di Bidang Teknologi Informatika. Adapun ...