Scientific Journal of Informatics
Vol 11, No 1 (2024): February 2024

Indonesian News Text Summarization Using MBART Algorithm

Rahma Hayuning Astuti (Department of Computer Science, Universitas Dian Nuswantoro, Indonesia)
Muljono Muljono (Department of Computer Science, Universitas Dian Nuswantoro, Indonesia)
Sutriawan Sutriawan (Department of Computer Science, Universitas Muhammadiyah Bima, Indonesia)



Article Info

Publish Date
29 Feb 2024

Abstract

Purpose: Technology advancements have led to the production of a large amount of textual data. There are numerous locations where one can find textual information sources, including blogs, news portals, and websites. Kompas, BBC, Liputan 6, CNN, and other news portals are a few websites that offer news in Indonesian. The purpose of this study was to explore the effectiveness of using mBART in text summarization for Bahasa Indonesia.Methods: This study uses mBART, a transformer architecture, to perform fine-tuning to generate news article summaries in Bahasa Indonesia. Evaluation was conducted using the ROUGE method to assess the quality of the summaries produced.Results: Evaluation using the ROUGE metric showed better results, with ROUGE-1 of 35.94, ROUGE-2 of 16.43, and ROUGE-L of 29.91. However, the performance of the model is still not optimal compared to existing models in text summarization for another language.Novelty: The novelty of this research lies in the use of mBART for text summarization, specifically adapted for Bahasa Indonesia. In addition, the findings also contribute to understanding the challenges and opportunities of improving text summarization techniques in the Indonesian context.

Copyrights © 2024






Journal Info

Abbrev

SJI

Publisher

Subject

Computer Science & IT

Description

Scientific Journal of Informatics published by the Department of Computer Science, Semarang State University, a scientific journal of Information Systems and Information Technology which includes scholarly writings on pure research and applied research in the field of information systems and ...