STATISTIKA
Vol. 23 No. 1 (2023): Statistika

Perbandingan Performa Metode Berbasis Support Vector Machine untuk Penanganan Klasifikasi Multi Kelas Tidak Seimbang

Qorry Meidianingsih (Universitas Negeri Jakarta)
Devi Eka Wardani (Universitas Negeri Jakarta)
Ellis Salsabila (Universitas Negeri Jakarta)
Lina Nafisah (Program Studi Statistika, Fakultas MIPA, Universitas Negeri Jakarta)
Afifah Nur Mutia (Program Studi Statistika, Fakultas MIPA, Universitas Negeri Jakarta)



Article Info

Publish Date
25 Jun 2023

Abstract

ABSTRAK Permasalahan data multi kelas tidak seimbang mulai mendapatkan perhatian dari komunitas peneliti dalam beberapa tahun terakhir. Permasalahan klasifikasi pada kasus multi kelas tidak seimbang menjadi lebih rumit karena sebagian besar teknik klasifikasi multi kelas diterapkan pada kondisi kelas yang seimbang, sedangkan dalam realisasinya data yang ditemukan lebih sering memiliki kelas tidak seimbang. Penelitian ini fokus pada membandingkan performa tiga metode klasifikasi berbasis support vector machine, yaitu SVM standar, SVM-SMOTE, dan granular support vector machines–repetitive undersampling (GSVM-RU) dimana metode dekomposisi one-versus-one (OVO) diterapkan. Terdapat tiga jenis data hasil bangkitan software R yang dirancang berdasarkan kombinasi jumlah kelas mayoritas dan minoritas yang mungkin terjadi. Hasil penelitian menunjukkan bahwa ketiga model klasifikasi menunjukkan tingkat akurasi tertinggi pada data simulasi yang memiliki perbandingan persentase antara jumlah amatan kelas mayoritas dan minoritasnya paling tinggi. Berdasarkan kriteria sensitivitas dan spesifisitas, model klasifikasi SVM standar dan SVM-SMOTE memberikan performa yang sama baiknya pada kelas mayoritas, sedangkan model klasifikasi GSVM-RU memiliki performa yang baik dalam mendeteksi kelas minoritas. ABSTRACT The problem of data with imbalances in multi-class has begun to receive attention from the research community in recent years. Classification problems in imbalanced multi-class cases become more complicated because most of the classification techniques in multi-class are applied to balanced class conditions, whereas in reality, the data found more often have unbalanced classes. This study focuses on comparing the performance of three support vector machine-based classification methods, namely standard SVM, SVM-SMOTE, and granular support vector machines–repetitive undersampling (GSVM-RU) where the one-versus-one (OVO) decomposition method is applied. There are three types of data generated by R software that are designed based on a combination of the number of possible majority and minority classes. The results showed that the three classification models showed the highest level of accuracy in the simulation data which had the highest percentage comparison between the number of observations of the majority and minority classes. Based on the sensitivity and specificity criteria, the standard SVM and SVM-SMOTE classification models provide equally good performance in the majority class, while the GSVM-RU classification model has good performance in detecting the minority class.

Copyrights © 2023






Journal Info

Abbrev

statistika

Publisher

Subject

Decision Sciences, Operations Research & Management Mathematics

Description

STATISTIKA published by Department of Statistics, Faculty of Mathematics and Natural Sciences, Bandung Islamic University as pouring media and discussion of scientific papers in the field of statistical science and its applications, both in the form of research results, discussion of theory, ...