International Journal of Computing Science and Applied Mathematics
Vol 6, No 1 (2020)

Handling Imbalance Data in Classification Model with Nominal Predictors

Kartika Fithriasari (Department of Statistics, Institut Teknologi Sepuluh Nopember)
Iswari Hariastuti (the National Family Planning Coordinating Board (BKKBN), East Java, Indonesia)
Kinanthi Sukma Wening (Department of Statistics, Institut Teknologi Sepuluh Nopember)



Article Info

Publish Date
21 Feb 2020

Abstract

Decision tree, one of classification method, can be done to find out the factors that predict something with interpretable result. However, a small and unbalanced percentage will make the classification only lead to the majority class. Therefore, handling imbalance class needs to be done. One method that often used in nominal predictor data is SMOTE-N. For accuracy improving, a hybrid SMOTE-N and ADASYN-N was developed. SMOTE-N-ENN and ADASYN-N were developed for accuracy improvement. In this study, SMOTE-N, SMOTE-N-ENN and ADASYN-N will be compared in handling imbalance class in the classification of premarital sex among adolescent using base class CART. The conclusion obtained regarding the best method for handling class imbalance is ADASYN-N because it provides the highest AUC compared to SMOTE-N and SMOTE-N-ENN. The best decision tree provides information that factors that can predict adolescents having premarital sexual relations are dating style, knowledge of the fertile period, knowledge of the risk of young marriage, gender, recent education, and area of residence.

Copyrights © 2020






Journal Info

Abbrev

ijcsam

Publisher

Subject

Computer Science & IT Education Mathematics

Description

(IJCSAM) International Journal of Computing Science and Applied Mathematics is an open access journal publishing advanced results in the fields of computations, science and applied mathematics, as mentioned explicitly in the scope of the journal. The journal is geared towards dissemination of ...