Garuda - Garba Rujukan Digital

Jurnal Rekayasa elektrika

Vol 14, No 1 (2018)

Amalia Amalia (Universitas Sumatera Utara)
Maya Silvi Lydia (Universitas Sumatera Utara)
Siti Dara Fadilla (Universitas Sumatera Utara)
Miftahul Huda (Universitas Sumatera Utara)

Publish Date
27 Apr 2018

Clustering is an unsupervised method to group multiple objects based on the similarity automatically. The quality of clustering accuracy is determined by the number of similar objects in a correct cluster group. The robust preprocessing process and the choice of cluster algorithm can increase the efficiency of clustering. The objective of this study is to observe the most suitable method to cluster document in Bahasa Indonesia. We performed tests on several cluster algorithms such as K-Means, K-Means++ and Agglomerative with various preprocessing stages and collected the accuracy of each algorithm. Clustering experiments were conducted on a corpus containing 100 documents in Bahasa Indonesia with a commonly used preprocessing scenario. Additionally, we also attach our preprocessing stages such as LSA function, TF-IDF function, and LSA / TF-IDF function. We tested various LSA dimension reductions values from 10% to 90%, and the result shows that the best percentage of reduction rates between 50%-80%. The result also indicates that K-Means++ algorithm produces better purity values than other algorithms.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

Jurnal Rekayasa elektrika

Website

Abbrev

JRE

Publisher

Universitas Syiah Kuala

Subject

Computer Science & IT Control & Systems Engineering Electrical & Electronics Engineering Energy Engineering

Description

The journal publishes original papers in the field of electrical, computer and informatics engineering which covers, but not limited to, the following scope: Electronics: Electronic Materials, Microelectronic System, Design and Implementation of Application Specific Integrated Circuits (ASIC), VLSI ...

Article Info

Abstract

Perbandingan Metode Klaster dan Preprocessing Untuk Dokumen Berbahasa Indonesia

Article Info

Abstract