Indonesian Journal on Computing (Indo-JC)
Vol. 4 No. 2 (2019): September, 2019

Named Entity Recognition for an Indonesian Based Language Tweet using Multinomial Naive Bayes Classifier

Ramadhyni Rifani (Telkom University)
Moch Arif Bijaksana (Telkom University)
Ibnu Asror (Telkom University)



Article Info

Publish Date
09 Sep 2019

Abstract

In Natural Languange Processing (NLP), Named Entity Recognition (NER) is a sub discussion that is widely used for research. the main task of Named Entity Recognition (NER) is to help identify and detect the entity names from a word in a sentence. The data sources we use are a real time Indonesian language tweets that often occur, which the number of letter each tweet is limited to 280 characters. The words contained in that Indonesian language tweets can refer to the name of the entity, location, or organization, so to determine the name of that entity, it must be considered first by looking at the word patterns around it. In Indonesia, an average tweet posted from an account at least is 1-3 tweets per day which contain a formal and non-formal contents that made this a difficult challenge to provide the right entity naming. In this research, we are naming the entities from the Indonesian language tweets by using the Multinomial Naive Bayes Classifier algorithm. The system uses precision, recall,and f-measure as evaluation metrics. Naming this entity is able to classify with a value of f-1 reaching 80%.

Copyrights © 2019






Journal Info

Abbrev

indojc

Publisher

Subject

Computer Science & IT

Description

Indonesian Journal on Computing (Indo-JC) is an open access scientific journal intended to bring together researchers and practitioners dealing with the general field of computing. Indo-JC is published by School of Computing, Telkom University ...