Jurnal Teknologi dan Sistem Komputer
Volume 8, Issue 1, Year 2020 (January 2020)

Perbandingan penghitungan jarak pada k-nearest neighbour dalam klasifikasi data tekstual

Wahyono Wahyono (Department of Computer Science and Electronic, Universitas Gadjah Mada)
I Nyoman Prayana Trisna (Master of Computer Science, Universitas Gadjah Mada)
Sarah Lintang Sariwening (Master of Computer Science, Universitas Gadjah Mada)
Muhammad Fajar (Master of Computer Science, Universitas Gadjah Mada)
Danur Wijayanto (Master of Computer Science, Universitas Gadjah Mada)



Article Info

Publish Date
31 Jan 2020

Abstract

One algorithm to classify textual data in automatic organizing of documents application is KNN, by changing word representations into vectors. The distance calculation in the KNN algorithm becomes essential in measuring the closeness between data elements. This study compares four distance calculations commonly used in KNN, namely Euclidean, Chebyshev, Manhattan, and Minkowski. The dataset used data from Youtube Eminem’s comments which contain 448 data. This study showed that Euclidian and Minkowski on the KNN algorithm achieved the best result compared to Chebycev and Manhattan. The best results on KNN are obtained when the K value is 3.

Copyrights © 2020






Journal Info

Abbrev

JTSISKOM

Publisher

Subject

Computer Science & IT Electrical & Electronics Engineering

Description

Jurnal Teknologi dan Sistem Komputer (JTSiskom, e-ISSN: 2338-0403) adalah terbitan berkala online nasional yang diterbitkan oleh Departemen Teknik Sistem Komputer, Universitas Diponegoro, Indonesia. JTSiskom menyediakan media untuk mendiseminasikan hasil-hasil penelitian, pengembangan dan ...