Stenly Tirta Wijaya
Unknown Affiliation

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

PENGEMBANGAN SISTEM AGREGATOR BERITA BAHASA INDONESIA MENGGUNAKAN CONTENT EXTRACTION DAN HIERARCHICAL AGGLOMERATIVE CLUSTERING Stenly Tirta Wijaya; Viny Christanti Mawardi; Janson Hendryli
Jurnal Ilmu Komputer dan Sistem Informasi Vol 4, No 2 (2016): Jurnal Ilmu Komputer dan Sistem Informasi
Publisher : Fakultas Teknologi Informasi Universitas Tarumanagara

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (109.443 KB) | DOI: 10.24912/jiksi.v4i2.129

Abstract

The main focus of this study is to develop system to aggregate Indonesian online newspaper and cluster it according to its topic automatically. The system use content extraction to get the main content of articles and Hierarchical Agglomerative Clustering to group articles by its topic with Dice Similarity Coefficient for similarity measure. To determine the cutting point, we cut dendrogram where the gap between two successive combination similarities is largest. Additionally, we add threshold to limit cutting area to improve cluster result. We use Standard Boolean Model for searching feature and Silhouette to evaluate cluster results. Test results using 998 articles shows that limiting cutting area with 0.1 and 0.5 can produce highest average silhouette value 0.264.