Lusiyana Adetia Isadi
Fakultas Ilmu Komputer, Universitas Brawijaya

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Temu Kembali Informasi Lintas Bahasa Dokumen Berita Bahasa Indonesia-Inggris menggunakan Metode BM25F Lusiyana Adetia Isadi; Indriati Indriati; Putra Pandu Adikara
Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer Vol 4 No 4 (2020): April 2020
Publisher : Fakultas Ilmu Komputer (FILKOM), Universitas Brawijaya

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (936.289 KB)

Abstract

News is a source of information that displayed to the general public about an event and presented in various languages. Usually, a website only allows user to search only in one language. This causes problems for users who want to find broader information more quickly in several languages ​​at once. These problems can be overcome by developing a cross language information retrieval system. The system can improve the time efficiency because it can return documents in two languages ​​by simply entering a query in one language only. One of the method that can be used to develop the system is BM25F method that can return relevant documents and handle structured documents. The news data structure used in training and testing is the title and the content part of the news. The data used in this study are 300 Indonesian news documents and 300 English news documents that will be used to test the boost value, the Indonesian queries, and the English queries. For the boost value testing, the highest precision@k value obtained when the title boost is 5 and the content boost is 1. This value will be used for query testing. Query testing is performed using precision@k and got the highest value of 0.98 when k=5 in the Indonesian queries test which returned Indonesian and English documents.