Jurnal Linguistik Komputasional
Vol 5 No 2 (2022): Vol. 5, No. 2

GAN-Based End to End Text-to-Speech System for Indonesian Language

Moch Azhar Dhiaulhaq (Institut Teknologi Bandung)
Rizki Rivai Ginanjar (Prosa.ai, PT Prosa Solusi Cerdas)
Dessi Puji Lestari (Institut Teknologi Bandung)



Article Info

Publish Date
28 Oct 2022

Abstract

The developments of the modern text-to-speech (TTS) technology have matured in which the direction of the recent approaches has moved toward the optimization of the system and TTS modeling from the resource-scarce languages, rather than finding new model architectures. In this paper, a novel approach to modeling modern end-to-end (E2E) TTS for Indonesian language with the integration of three different generative adversarial networks (GAN)-based vocoders for comparison is proposed. Based on the evaluation, the proposed system shows promising results with the mean opinion score (MOS) value of 4.60 while still maintaining fast inference speed, proven by the real-time factor (RTF) value under one.

Copyrights © 2022






Journal Info

Abbrev

jlk

Publisher

Subject

Computer Science & IT

Description

Jurnal Linguistik Komputasional (JLK) menerbitkan makalah orisinil di bidang lingustik komputasional yang mencakup, namun tidak terbatas pada : Phonology, Morphology, Chunking/Shallow Parsing, Parsing/Grammatical Formalisms, Semantic Processing, Lexical Semantics, Ontology, Linguistic Resources, ...