Garuda - Garba Rujukan Digital

p-Index From 2019 - 2024

1.347

P-Index

This Author published in this journals

All Journal IJCCS (Indonesian Journal of Computing and Cybernetics Systems) KLIK (Kumpulan jurnaL Ilmu Komputer) (e-Journal) Indonesian Journal of Information System PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND OFFICIAL STATISTICS

Lya Hulliyyatus Suadaa

Politeknik Statistika STIS

Author-ID : 2899463

Computer Science & IT Control & Systems Engineering

Published : 8 Documents Claim Missing Document

Claim Missing Document

Articles

Title

PENGUKURAN TINGKAT KEMIRIPAN DOKUMEN BERBASIS CLUSTER Ibnu Santoso; Lya Hulliyyatus Suadaa
KLIK- KUMPULAN JURNAL ILMU KOMPUTER Vol 6, No 1 (2019)
Publisher : Lambung Mangkurat University

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.20527/klik.v6i1.181

Document similarity can be measured and used to discover other similar documents in a document collection (corpus). In a small corpus, measuring document similarity is not a problem. In a bigger corpus, comparing similarity rate between documents can be time consuming. A clustering method can be used to minimize number of document collection that has to be compared to a document to save time. This research is aimed to discover the effect of clustering technique in measuring document similarity and evaluate the performance. Corpus used was undergraduate thesis of Politeknik Statistika STIS students from year 2007-2016 as many as 2.049 documents. These documents were represented as bag of words model and clustered using k-means clustering method. Measurement of similarity used is Cosine similarity. From the simulation, clustering process for 3 clusters needs longer preparation time (17,32%) but resulting in faster query processing (77,88%) with accuracy of 0,98. Clustering process for 5 clusters needs longer preparation time (31,10%) but resulting in faster query processing (83,79%) with accuracy of 0,86. Clustering process for 7 clusters needs longer preparation time (45,10%) but resulting in faster query processing (85,30%) with accuracy of 0,98.

Co-Authors Amanda Tabitha Bulan Panjaitan Cynthia As Bahri Hana Raihanatul Jannah Ibnu Santoso Iftitah Athiyyah Rahma Indah Simbolon Muhammad Farhan Nicholas H Manurung Nur Ainun Daulay Renata De La Rosa Manik Rifqi Ramadhan Rizka Maulida Yanti Sukma Andini

Title Search

Found 1 Documents Search Journal : KLIK (Kumpulan jurnaL Ilmu Komputer) (e-Journal)

Abstract

Title

Found 1 Documents
Search
Journal : KLIK (Kumpulan jurnaL Ilmu Komputer) (e-Journal)