Muhamad Arief Hidayat
Universitas Jember

Published : 3 Documents Claim Missing Document
Claim Missing Document
Check
Articles

Found 1 Documents
Search
Journal : SPIRIT

KLASIFIKASI BERBASIS GRAVITASI DATA DAN PROBABILITAS POSTERIOR Muhamad Arief Hidayat; Arif Djunaidy
SPIRIT Vol 7, No 1 (2015): SPIRIT
Publisher : STMIK YADIKA BANGIL

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (755.016 KB) | DOI: 10.53567/spirit.v7i1.23

Abstract

The classification method based on data gravitation (DGC) is one of the new classification techniques that uses data  gravitation as the criteria of the classification. In the case of DGC, an object is classified on the basis of the class that creates the largest gravitation in that object. However, the DGC method may cause inaccurate result when the training data being used suffer from the class imbalanced problem. This may be caused by the existence of the training data containing a class having excessively big mass that will in turn tend to classify an uknown object as a member of that class due to the high degree of the data gravitation produced, and vice versa. In this research, a modification to the DGC method is performed by constructing a classificaion method that is based on both the data gravitation and posterior probability (DGCPP). In DGCPP, the mass concept defined in the DGC method as the prior probability is replaced by the posterior probability. By using this modification, data gravitation calculation process is expected to produce more accurate results in compared to those produced by the DGC method. In addtion, by improving the data gravitation calculation, it is expected that the DGCPP method willproduce more accurate classification results in compared to those produced by the DGC method for both normal dataset as well as dataset having class imbalanced problems. A thorough tests for evaluating the classification accuracy are performed using a ten-fold cross-validation method on several datasets containing both normal andimbalanced-class datasets. The results showed that DGCPP method produced positive average of accuracy differences in compared to those produced by the DGC method. For the tests using the entire normal datasets showed that the average of accuracy differences are statistically significant with a 95% confidence level. In addition, results of the tests using the four imbalanced-class datasets also showed that the average accuracy differences are statistically significant with a 95% confidence level. Finally, results of the tests for evaluating the computing times required by the classification program showed that the additional computing time needed by DGCPP method to perform the classification process is insignificant and less than the human response time, in compared to that needed by DGC method for running all datasets being used.  Keywords—data gravitation-based classification, class imbalanced problem,posterior probability