Jurnal Teknik Industri
Vol. 8 No. 1 (2006): JUNE 2006

HARD: SUBJECT-BASED SEARCH ENGINE MENGGUNAKAN TF-IDF DAN JACCARD’S COEFFICIENT

Rolly Intan (Faculty of Industrial Technology, Petra Christian University)
Andrew Defeng (Faculty of Industrial Technology, Petra Christian University)



Article Info

Publish Date
11 Oct 2006

Abstract

This paper proposes a hybridized concept of search engine based on subject parameter of High Accuracy Retrieval from Documents (HARD). Tf-Idf and Jaccard's Coefficient are modified and extended to providing the concept. Several illustrative examples are given including their steps of calculations in order to clearly understand the proposed concept and formulas. Abstract in Bahasa Indonesia : Paper ini memperkenalkan suatu algorima search engine berdasarkan konsep HARD (High Accuracy Retrieval from Documents) dengan menggabungkan penggunaan metoda TF-IDF (Term Frequency Inverse Document Frequency) dan Jaccard's Coefficient. Kedua metoda, TF-IDF dan Jaccard's Coefficient dimodifikasi dan dikembangkan dengan memperkenalkan beberapa rumusan baru. Untuk lebih memudahkan dalam mengerti algoritma dan rumusan baru yang diperkenalkan, beberapa contoh perhitungan diberikan. Kata kunci: HARD, Tf-Idf, koefisien Jaccard, search engine, himpunan fuzzy.

Copyrights © 2006






Journal Info

Abbrev

ind

Publisher

Subject

Industrial & Manufacturing Engineering

Description

Jurnal Teknik Industri aims to: Promote a comprehensive approach to the application of industrial engineering in industries as well as incorporating viewpoints of different disciplines in industrial engineering. Strengthen academic exchange with other institutions. Encourage scientist, practicing ...