CommIT (Communication & Information Technology)
Vol 11, No 1 (2017): CommIT Vol. 11 No. 1 Tahun 2017

The Performance of Boolean Retrieval and Vector Space Model in Textual Information Retrieval

Yulianto, Budi (Unknown)
Budiharto, Widodo (Unknown)
Kartowisastro, Iman Herwidiana (Unknown)



Article Info

Publish Date
01 Aug 2017

Abstract

Boolean Retrieval (BR) and Vector Space Model (VSM) are very popular methods in information retrieval for creating an inverted index and querying terms. BR method searches the exact results of the textual information retrieval without ranking the results. VSM method searches and ranks the results. This study empirically compares the two methods. The research utilizes a sample of the corpus data obtained from Reuters. The experimental results show that the required times to produce an inverted index by the two methods are nearly the same. However, a difference exists on the querying index. The results also show that the numberof generated indexes, the sizes of the generated files, and the duration of reading and searching an index are proportional with the file number in the corpus and thefile size.

Copyrights © 2017






Journal Info

Abbrev

COMMIT

Publisher

Subject

Computer Science & IT

Description

Journal of Communication and Information Technology (CommIT) focuses on various issues spanning: software engineering, mobile technology and applications, robotics, database system, information engineering, artificial intelligent, interactive multimedia, computer networking, information system ...