Scientific Journal of Informatics
Vol. 6 No. 1 (2021): Jurnal Ilmiah Informatika

Pengaruh Jumlah Record Dataset Terhadap Algoritma Klasifikasi Berdasarkan Data Customer Churn

Anita (Universitas Singaperbangsa Karawang)
Anggito Wicaksono (Universitas Singaperbangsa Karawang)
Tesa Nur Padilah (Universitas Singaperbangsa Karawang)



Article Info

Publish Date
29 Jun 2021

Abstract

Telecommunication is one of the fastest growing industrial sectors so that there are more telecommunication companies. This can create various threats if the company does not use the strategy properly. Customer churn refers to the level of customer reduction which is one of the threats to reducing the company's revenue. This is an important issue for developing companies to evaluate in order to reduce the potential for churn that occurs. The initial stage that needs to be done is to predict customers who have the potential to switch from the company, one of which is the data mining approach. Classification is a data mining technique that can predict the class of datasets with various existing classification algorithms. The purpose of this study is to identify the effect of the number of dataset records on several classification algorithms. This research was conducted based on the CRISP-DM method by applying three classification algorithms, namely Logistic Regression, Naïve Bayes, and Decision Tree C4.5. The results showed that the greater the number of records in the dataset, the higher the accuracy value will be obtained. In dataset-1, logistic regression is a better algorithm based on an accuracy value of 80.09%, while naïve Bayes is superior based on an AUC value of 0.733 and an execution time of 0.00798 seconds. In dataset-2, it is found that decision tree is an algorithm that is more suitable than logistic regression and naïve Bayes algorithms, with an accuracy of 91.9% and an AUC value of 0.846 which is included in the good classification criteria. However, in execution time, the naïve Bayes algorithm only takes a processing time of 0.00403 seconds.

Copyrights © 2021






Journal Info

Abbrev

JIMI

Publisher

Subject

Computer Science & IT

Description

Topics cover the following areas (but are not limited to): 1. Information Technology (IT) a. Software engineering b. Game c. Information Retrieval d. Computer network e. Telecommunication f. Internet g. Wireless technology h. Network security i. Multimedia technology j. Mobile Computing k. ...