International Journal of Advances in Intelligent Informatics
Vol 6, No 2 (2020): July 2020

Hybrid deep neural network for Bangla automated image descriptor

Md Asifuzzaman Jishan (Department of Statistics, Technische Universität Dortmund)
Khan Raqib Mahmud (University of Liberal Arts Bangladesh)
Abul Kalam Al Azad (University of Liberal Arts Bangladesh)
Md Shahabub Alam (Department of Statistics, Technische Universität Dortmund)
Anif Minhaz Khan (Department of Statistics, Technische Universität Dortmund)



Article Info

Publish Date
12 Jul 2020

Abstract

Automated image to text generation is a computationally challenging computer vision task which requires sufficient comprehension of both syntactic and semantic meaning of an image to generate a meaningful description. Until recent times, it has been studied to a limited scope due to the lack of visual-descriptor dataset and functional models to capture intrinsic complexities involving features of an image. In this study, a novel dataset was constructed by generating Bangla textual descriptor from visual input, called Bangla Natural Language Image to Text (BNLIT), incorporating 100 classes with annotation. A deep neural network-based image captioning model was proposed to generate image description. The model employs Convolutional Neural Network (CNN) to classify the whole dataset, while Recurrent Neural Network (RNN) and Long Short-Term Memory (LSTM) capture the sequential semantic representation of text-based sentences and generate pertinent description based on the modular complexities of an image. When tested on the new dataset, the model accomplishes significant enhancement of centrality execution for image semantic recovery assignment. For the experiment of that task, we implemented a hybrid image captioning model, which achieved a remarkable result for a new self-made dataset, and that task was new for the Bangladesh perspective. In brief, the model provided benchmark precision in the characteristic Bangla syntax reconstruction and comprehensive numerical analysis of the model execution results on the dataset.

Copyrights © 2020






Journal Info

Abbrev

IJAIN

Publisher

Subject

Computer Science & IT

Description

International journal of advances in intelligent informatics (IJAIN) e-ISSN: 2442-6571 is a peer reviewed open-access journal published three times a year in English-language, provides scientists and engineers throughout the world for the exchange and dissemination of theoretical and ...