Journal of Electronics, Electromedical Engineering, and Medical Informatics
Vol 5 No 4 (2023): October

Feature Selection Using Firefly Algorithm With Tree-Based Classification In Software Defect Prediction

Vina Maulida (-)
Rudy Herteno (Fakultas Matematika dan Ilmu Pengetahuan Alam , Universitas Lambung Mangkurat , Indonesia)
Dwi Kartini (Fakultas Matematika dan Ilmu Pengetahuan Alam , Universitas Lambung Mangkurat , Indonesia)
Friska Abadi (Fakultas Matematika dan Ilmu Pengetahuan Alam , Universitas Lambung Mangkurat , Indonesia)
Mohammad Reza Faisal (Fakultas Matematika dan Ilmu Pengetahuan Alam , Universitas Lambung Mangkurat , Indonesia)



Article Info

Publish Date
11 Aug 2023

Abstract

Defects that occur in software products are a universal occurrence. Software defect prediction is usually carried out to determine the performance, accuracy, precision and performance of the prediction model or method used in research, using various kinds of datasets. Software defect prediction is one of the Software Engineering studies that is of great concern to researchers. This research was conducted to determine the performance of tree-based classification algorithms including Decision Trees, Random Forests and Deep Forests without using feature selection and using firefly feature selection. And also know the tree-based classification algorithm with firefly feature selection which can provide better software defect prediction performance. The dataset used in this study is the ReLink dataset which consists of Apache, Safe and Zxing. Then the data is divided into testing data and training data with 10-fold cross validation. Then feature selection is performed using the Firefly Algorithm. Each ReLink dataset will be processed by each tree-based classification algorithm, namely Decision Tree, Random Forest and Deep Forest according to the results of the firefly feature selection. Performance evaluation uses the AUC value (Area under the ROC Curve). Research was conducted using google collab and the average AUC value generated by Firefly-Decision Tree is 0.66, the average AUC value generated by Firefly-Random Forest is 0.77, and the average AUC value generated by Firefly-Deep Forest is 0, 76. The results of this study indicate that the approach using the Firefly algorithm with Random Forest classification can work better in predicting software damage compared to other tree-based algorithms. In previous studies, tree-based classification with hyperparameter tuning on software defect prediction datasets obtained quite good results. In another study, the classification performance of SVM, Naïve Bayes and K-nearest neighbor with firefly feature selection resulted in improved performance. Therefore, this research was conducted to determine the performance of a tree-based algorithm using the firefly selection feature.

Copyrights © 2023






Journal Info

Abbrev

jeeemi

Publisher

Subject

Computer Science & IT Control & Systems Engineering Electrical & Electronics Engineering Engineering

Description

The Journal of Electronics, Electromedical Engineering, and Medical Informatics (JEEEMI) is a peer-reviewed open-access journal. The journal invites scientists and engineers throughout the world to exchange and disseminate theoretical and practice-oriented topics which covers three (3) majors areas ...