JIKSI (Jurnal Ilmu Komputer dan Sistem Informasi)
Vol 2, No 1 (2014): Jurnal Ilmu Komputer dan Sistem Informasi

EKSTRAKSI PAPER CITATION UNTUK PENDETEKSIAN SITASI PADA TULISAN KARYA ILMIAH BAHASA INDONESIA

Kevin Kevin (Unknown)
Viny Christanti (Unknown)
Prof. Dr. Ir. Dali S. Naga, MMSI (Unknown)



Article Info

Publish Date
31 Jan 2014

Abstract

The main focus of this study is to develop system to extract Indonesian paper citations with a good level of accuracy. The system based on ParsCit with feature adjustment and document training. In addition of ParsCit initial features, we add new features to match Indonesian environment along with new training data consist of Indonesian labeled headers and citations. We applied a probabilistic method Conditional Random Field (CRF) for labeling token in scientific paper reference string. CRF learns new characteristics of each entity using the new Indonesian data and build a model based on it. This model can be applied to unseen data and tested on Indonesian scientific papers. Test results shows that CRF can be applied well for Indonesian papers. System accurately labeled Indonesian paper citation with average accuracy of 98% for headers and 94% for citations. Key wordsConditional Random Field, Fakultas Teknologi Informasi Universitas Tarumanagara, Parsing Citation, Karya Ilmiah Bahasa Indonesia, Pengenalan Entitas, ParsCit

Copyrights © 2014






Journal Info

Abbrev

jiksi

Publisher

Subject

Computer Science & IT Mathematics Other

Description

Jurnal Ilmu Komputer dan Sistem Informasi (JIKSI) diterbitkan oleh Fakultas Teknologi Informasi Universitas Tarumanagara (FTI Untar) Jakarta sebagai media publikasi karya ilmiah mahasiswa program studi Teknik Informatika dan Sistem Informasi FTI Untar. Karya-karya ilmiah yang dihasilkan berupa hasil ...