Communications in Science and Technology
Vol 5 No 1 (2020)

Comparison of text-image fusion models for high school diploma certificate classification

Chandra Ramadhan Atmaja Perdana (Universitas Gadjah Mada)
Hanung Adi Nugroho (Universitas Gadjah Mada)
Igi Ardiyanto (Universitas Gadjah Mada)



Article Info

Publish Date
02 Jul 2020

Abstract

File scanned documents are commonly used in this digital era. Text and image extraction of scanned documents play an important role in acquiring information. A document may contain both texts and images. A combination of text-image classification has been previously investigated. The dataset used for those research works the text were digitally provided. In this research, we used a dataset of high school diploma certificate, which the text must be acquired using optical character recognition (OCR) method. There were two categories for this high school diploma certificate, each category has three classes. We used convolutional neural network for both text and image classifications. We then combined those two models by using adaptive fusion model and weight fusion model to find the best fusion model. We come into conclusion that the performance of weight fusion model which is 0.927 is better than that of adaptive fusion model with 0.892.

Copyrights © 2020






Journal Info

Abbrev

cst

Publisher

Subject

Engineering

Description

Communication in Science and Technology [p-ISSN 2502-9258 | e-ISSN 2502-9266] is an international open access journal devoted to various disciplines including social science, natural science, medicine, technology and engineering. CST publishes research articles, reviews and letters in all areas of ...