JURNAL ILMIAH INFORMATIKA
Vol 10 No 01 (2022): Jurnal Ilmiah Informatika (JIF)

SEGMENTASI DOKUMEN TEKS DENGAN METODE TEXTTILING

Chintalya Magdalena (Universitas Kristen Indonesia)
Bangun Hyolister Tambun (Universitas Kristen Indonesia)



Article Info

Publish Date
01 Mar 2022

Abstract

In this paper, we will report our work on text segmentation on Indonesian speech documents. As a result of using Automatic Speech Recognition (ASR), the speech documents are transcribed into the text without any boundary for each document. The documents are certainly needed to be segmented regarding to its topics. We apply TextTiling method with various term weighted techniques such as TF-IDF, TF-IDF-Mutual Information, TF-IDF Mutual Information-Word Similarity, and TF-IDF-Word Frequency for measuring the similarity between segments. The result show TF-IDF-Mutual Information performed better in most of the collections.

Copyrights © 2022






Journal Info

Abbrev

jif

Publisher

Subject

Computer Science & IT

Description

Jurnal Teknologi Informatika dan Sistem Informasi Fakultas Teknik dan Komputer UPB, telah menerbitkan publikasi ilmiah dengan topik yang mencakup tentang Information System, Geographical Information System, Remote Sensing, Cryptography,artificial intelligence, Computer Network, Security dan ...