Eksponensial
Vol 12 No 2 (2021)

Optimalisasi K-Means Cluster dengan Principal Component Analysis pada Pengelompokan Kabupaten/Kota di Pulau Kalimantan Berdasarkan Indikator Tingkat Pengangguran Terbuka

Muhammad Rais (Laboratorium Statistika Komputasi FMIPA Universitas Mulawarman)
Rito Goejantoro (Laboratorium Statistika Komputasi FMIPA Universitas Mulawarman)
Surya Prangga (Laboratorium Statistika Komputasi FMIPA Universitas Mulawarman)



Article Info

Publish Date
30 Dec 2021

Abstract

Data mining or often also called knowledge discovery in databases is an activity that includes collecting, using historical data to find regularity, patterns, or relationships in large data sets resulting in useful new information. Cluster analysis is an analysis that aims to group data based on its likeness. This research uses the K-Means method combined with PCA. The K-Means method groups data in the form of one or more clusters that share the same characteristics. While the PCA method was used to reduce research variables. This grouping method was applied to the data indicator of the unemployment rate of districts/cities in Kalimantan Island in 2018. The cluster validation used in this study was the Davies-Bouldin Index (DBI). Based on the results of the analysis, it was concluded that the number of principal components formed was as many as 2 principal components. The most optimal grouping of districts/cities in Kalimantan island in 2018 was to use 2 clusters with a DBI value of 0,507. The grouping of districts/cities in Kalimantan Island in 2018 produced 2 clusters, cluster 1 consisting of 51 districts/cities and clusters of 2 consisting of 5 districts/cities. Cluster 1 was a cluster that has the highest percentage of the poor population and the highest labor force participation rate when compared to cluster 2. While cluster 2 was a cluster that has an index value of human development, population, number of the labor force, number of unemployed, population density, and the minimum wage of district/city was high compared to cluster 1.

Copyrights © 2021






Journal Info

Abbrev

exponensial

Publisher

Subject

Computer Science & IT Decision Sciences, Operations Research & Management Economics, Econometrics & Finance Mathematics Other

Description

Jurnal Eksponensial is a scientific journal that publishes articles of statistics and its application. This journal This journal is intended for researchers and readers who are interested of statistics and its ...