International Journal of Artificial Intelligence Research
Vol 4, No 2 (2020): December 2020

Random and Synthetic Over-Sampling Approach to Resolve Data Imbalance in Classification

Mardhiya Hayaty (Amikom Yogyakarta University)
Siti Muthmainah (Amikom Yogyakarta University)
Syed Muhammad Ghufran (Department of Mathematics Abdul Wali Khan University, Mardan Garden Campus)



Article Info

Publish Date
05 Dec 2020

Abstract

High accuracy value is one of the parameters of the success of classification in predicting classes. The higher the value, the more correct the class prediction.  One way to improve accuracy is dataset has a balanced class composition. It is complicated to ensure the dataset has a stable class, especially in rare cases. This study used a blood donor dataset; the classification process predicts donors are feasible and not feasible; in this case, the reward ratio is quite high. This work aims to increase the number of minority class data randomly and synthetically so that the amount of data in both classes is balanced. The application of SOS and ROS succeeded in increasing the accuracy of inappropriate class recognition from 12% to 100% in the KNN algorithm. In contrast, the naïve Bayes algorithm did not experience an increase before and after the balancing process, which was 89%. 

Copyrights © 2020






Journal Info

Abbrev

IJAIR

Publisher

Subject

Computer Science & IT Electrical & Electronics Engineering

Description

International Journal Of Artificial Intelligence Research (IJAIR) is a peer-reviewed open-access journal. The journal invites scientists and engineers throughout the world to exchange and disseminate theoretical and practice-oriented topics of Artificial intelligent Research which covers four (4) ...