This Author published in this journals
All Journal Jurnal Komputasi
Permata Permata
Univeristas Teknokrat Indonesia

Published : 1 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search

Efek Peningkatan Jumlah Paralel Korpus Pada Penerjemahan Kalimat Bahasa Indonesia ke Bahasa Lampung Dialek Api Permata Permata; Zaenal Abidin; Farida Ariyani
Jurnal Komputasi Vol 8, No 2 (2020)
Publisher : Universitas Lampung

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.23960/komputasi.v8i2.2613

Abstract

Experimental observations of the effect of the number of parallel corpus on Indonesian translation into the Lampung dialect api were carried out using the statistical machine translation (SMT) method. SMT utilizes a parallel Indonesian corpus and its translation in the Lampung dialect api as a material for training data. The research strategy was carried out in three ways, namely first strategy with a corpus parallel number of 1000 sentences, the second strategy with a corpus parallel number of 2000 and the third strategy with a corpus parallel number of 3000 sentences. The research starts from the preprocessing phase followed by the training phase, namely the parallel corpus processing phase to obtain a language model and translation model. Then the testing phase, and ends with the evaluation phase. SMT testing uses 25 single sentences without out-of-vocabulary (OOV), 25 single sentences with OOV, 25 compound sentences without OOV and 25 compound sentences with OOV. The test results of translating Indonesian sentences intoLampung dialectic api are shown through the accuracy value of Bilingual Evaluation Undestudy (BLEU) obtained in testing 25 single sentences without out-of-vocabulary (OOV) in the first strategy, the second and the third are 21.49%, 59.58% and 73.21%. In testing 25 single sentences with out-of-vocabulary (OOV) obtained in the first strategy, the second and the third are 23.22%, 44.33% and 68.72%. In testing 25 compound sentences without out-of-vocabulary(OOV) obtained in the first strategy, the second and the third are 18.22%, 39.4% and 69.18%. In testing 25 compound sentences with out-of-vocabulary (OOV) obtained in the first strategy, the second and the third are 25.94%, 28.22% and 71.94%.