Claim Missing Document
Check
Articles

Found 2 Documents
Search

PERBANDINGAN KINERJA METODE PRA-PEMROSESAN DALAM PENGKLASIFIKASIAN OTOMATIS DOKUMEN PATEN Budi Nugroho; Asep Denih
KOMPUTASI Vol 17, No 2 (2020): Komputasi: Jurnal Ilmiah Ilmu Komputer dan Matematika
Publisher : Ilmu Komputer, FMIPA, Universitas Pakuan

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (280.07 KB) | DOI: 10.33751/komputasi.v17i2.2148

Abstract

This paper presents a performance analysis and comparison of several pre-processing methods used in automatic patent classification with graph kernels for Support Vector Machine (SVM). The pre-processing methods are based on the data transform techniques, namely data scaling, data centering, data standardization, data normalization, the Box-Cox transform and the Yeo-Johnson transform. The automatic patent classification is designed to classify an input of patent citation graphs into one of 10 possible classes of the International Patent Classification (IPC). The input is taken with various background conditions. The experiments showed that the best result is achieved when the pre-processing method is data normalization, achieving a classification accuracy of up to 85.33.15% for the KEHL and 93.80% for the KVHL. In contrast, for the KEHG, the preprocessing method application decreased the accuracy.
PERBANDINGAN KINERJA METODE PRA-PEMROSESAN DALAM PENGKLASIFIKASIAN OTOMATIS DOKUMEN PATEN Budi Nugroho; Asep Denih
Komputasi: Jurnal Ilmiah Ilmu Komputer dan Matematika Vol 17, No 2 (2020): Komputasi: Jurnal Ilmiah Ilmu Komputer dan Matematika
Publisher : Ilmu Komputer, FMIPA, Universitas Pakuan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.33751/komputasi.v17i2.2148

Abstract

This paper presents a performance analysis and comparison of several pre-processing methods used in automatic patent classification with graph kernels for Support Vector Machine (SVM). The pre-processing methods are based on the data transform techniques, namely data scaling, data centering, data standardization, data normalization, the Box-Cox transform and the Yeo-Johnson transform. The automatic patent classification is designed to classify an input of patent citation graphs into one of 10 possible classes of the International Patent Classification (IPC). The input is taken with various background conditions. The experiments showed that the best result is achieved when the pre-processing method is data normalization, achieving a classification accuracy of up to 85.33.15% for the KEHL and 93.80% for the KVHL. In contrast, for the KEHG, the preprocessing method application decreased the accuracy.