Garuda - Garba Rujukan Digital

KLIK: Kajian Ilmiah Informatika dan Komputer

Vol. 3 No. 5 (2023): April 2023

Riska Aryanti (Universitas Bina Sarana Informatika, Jakarta)
Titik Misriati (Universitas Bina Sarana Informatika, Jakarta)
Rahmat Hidayat (Universitas Bina Sarana Informatika, Jakarta)

Publish Date
30 Apr 2023

Data imbalance is a common problem in classification, including in maternal health risk classification. Data imbalance occurs when the number of samples in the positive class is much less than the negative class. Data imbalance can cause the classification model to be inaccurate and tend to predict the majority class. One way to overcome the problem of data imbalance is to use the random oversampling technique. In this study, the random oversampling method is applied to overcome the problem of data imbalance in the classification of maternal health risks. Particle swarm optimization (PSO) is used for attribute weighting, improving the results of random oversampling and model performance. The results show that random oversampling can improve accuracy and reduce errors in predicting minority classes. In addition, the PSO technique also significantly contributed to improving the model's accuracy. The results of testing the random forest algorithm using 10-fold cross-validation on the health risks of pregnant women have an accuracy of 80.77%. After going through the random oversampling technique, the accuracy rate reaches 81.86%, and after optimization using the PSO technique, there is an increase of 2.15%, so the accuracy rate reaches 82.92%.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

KLIK: Kajian Ilmiah Informatika dan Komputer

Website

Abbrev

klik

Publisher

Sekolah Tinggi Manajemen Informatika dan Komputer Budi Darma

Subject

Computer Science & IT

Description

Topik utama yang diterbitkan mencakup: 1. Teknik Informatika 2. Sistem Informasi 3. Sistem Pendukung Keputusan 4. Sistem Pakar 5. Kecerdasan Buatan 6. Manajemen Informasi 7. Data Mining 8. Big Data 9. Jaringan Komputer 10. Dan lain-lain (topik lainnya yang berhubungan dengan Teknologi Informati dan ...

Article Info

Abstract

Klasifikasi Risiko Kesehatan Ibu Hamil Menggunakan Random Oversampling Untuk Mengatasi Ketidakseimbangan Data

Article Info

Abstract