Knowledge Engineering and Data Science
Vol 6, No 1 (2023)

K-Means Clustering and Multilayer Perceptron for Categorizing Student Business Groups

Miftahul Walid (Department of Informatics, Faculty of Engineering, Madura Islamic University, Indonesia)
Norfiah Lailatin Nispi Sahbaniya (Department of Informatics, Faculty of Engineering, Madura Islamic University, Indonesia)
Hozairi Hozairi (Department of Informatics, Faculty of Engineering, Madura Islamic University, Indonesia)
Fajar Baskoro (Department of Informatics, Faculty of Electrical Technology and Intelligent Informatics, Sepuluh Nopember Institute of Technology, Indonesia)
Arya Yudhi Wijaya (Department of Informatics, Faculty of Electrical Technology and Intelligent Informatics, Sepuluh Nopember Institute of Technology, Indonesia)



Article Info

Publish Date
18 Sep 2023

Abstract

The research conducted in this study was driven by the East Java provincial government's requirement to assess the transaction levels of the Student Business Group (KUS) in the SMA Double Track program. These transaction levels are a basis for allocating supplementary financial aid to each business group. The system's primary objective is to assist the provincial government of East Java in making well-informed choices pertaining to the distribution of supplementary capital to the KUS. The classification technique employed in this study is the multilayer perceptron. However, the K-Means Clustering method is utilised to generate target data due to the limited availability during the classification process, which involves dividing the transaction level attributes into three distinct groups: (0) low transactions, (1) medium transactions, and (2) high transactions. The clustering process encompasses three distinct features: (1) income, (2) spending, and (3) profit. These three traits will be utilized as input data throughout the categorization procedure. The classification procedure employing the Multilayer Perceptron technique involved processing a dataset including 1383 data points. The training data constituted 80% of the dataset, while the remaining 20% was allocated for testing. In order to evaluate the efficacy of the constructed model, the training error was assessed using K-Fold cross-validation, yielding an average accuracy score of 0.92. In the present study, the categorization technique yielded an accuracy of 0.96. This model aims to classify scenarios when the dataset lacks prior target data.

Copyrights © 2023






Journal Info

Abbrev

keds

Publisher

Subject

Computer Science & IT Engineering

Description

Knowledge Engineering and Data Science (2597-4637), KEDS, brings together researchers, industry practitioners, and potential users, to promote collaborations, exchange ideas and practices, discuss new opportunities, and investigate analytics frameworks on data-driven and knowledge base systems. ...