Garuda - Garba Rujukan Digital

IJCCS (Indonesian Journal of Computing and Cybernetics Systems)

Vol 14, No 4 (2020): October

Ade Nurhopipah (Department of Informatics, Universitas Amikom Purwokerto)
Uswatun Hasanah (Departement of Information Technology, Universitas Amikom Purwokerto)

Publish Date
31 Oct 2020

The performance of classification models in machine learning algorithms is influenced by many factors, one of which is dataset splitting method. To avoid overfitting, it is important to apply a suitable dataset splitting strategy. This study presents comparison of four dataset splitting techniques, namely Random Sub-sampling Validation (RSV), k-Fold Cross Validation (k-FCV), Bootstrap Validation (BV) and Moralis Lima Martin Validation (MLMV). This comparison is done in face classification on CCTV images using Convolutional Neural Network (CNN) algorithm and Support Vector Machine (SVM) algorithm. This study is also applied in two image datasets. The results of the comparison are reviewed by using model accuracy in training set, validation set and test set, also bias and variance of the model. The experiment shows that k-FCV technique has more stable performance and provide high accuracy on training set as well as good generalizations on validation set and test set. Meanwhile, data splitting using MLMV technique has lower performance than the other three techniques since it yields lower accuracy. This technique also shows higher bias and variance values and it builds overfitting models, especially when it is applied on validation set.

Citation Download

EndNote, Reference Manager, ProCite

Latex, Jabref

Check in Google Scholar

Journal Info

IJCCS (Indonesian Journal of Computing and Cybernetics Systems)

Website

Abbrev

ijccs

Publisher

Universitas Gadjah Mada

Subject

Computer Science & IT Control & Systems Engineering

Description

Indonesian Journal of Computing and Cybernetics Systems (IJCCS), a two times annually provides a forum for the full range of scholarly study . IJCCS focuses on advanced computational intelligence, including the synergetic integration of neural networks, fuzzy logic and eveolutionary computation, so ...

Article Info

Abstract

Dataset Splitting Techniques Comparison For Face Classification on CCTV Images

Article Info

Abstract