Journal of Data Science and Its Applications
Vol 3 No 1 (2020): Journal of Data Science and Its Applications

Implementation of Minimum Redundancy Maximum Relevance (MRMR) and Genetic Algorithm (GA) for Microarray Data Classification with C4.5 Decision Tree

Irne Mabarti (Student)



Article Info

Publish Date
11 May 2020

Abstract

Cancer is one of the highest causes of death in various countries, even an increase in mortality rates happens every year. On the other hand, bioinformatics technology will be beneficial for predicting cancer, one of the methods that can be considered in predicting cancer is the classification of microarrays data. Microarray data is data containing many gene expressions that describe DNA cells. Microarray data has enormous dimensions. The dimension reduction method used in this study is the Minimum Redundancy Maximum Relevance (MRMR), the optimization method used is the Genetic Algorithm (GA) method, and the last method is C4.5 aimed at classifying gene data. In this study, there were two trials. The first trial used the Minimum Redundancy Maximum Relevance (MRMR) method combined with Genetic Algorithm (GA) as an optimization method and the C4.5 classification method, and the trial resulted in an average accuracy of 79%. While the second trial using the Genetic Algorithm (GA) method for feature selection and the C4.5 classification method produces an average accuracy of 78%.

Copyrights © 2020






Journal Info

Abbrev

jdsa

Publisher

Subject

Computer Science & IT Decision Sciences, Operations Research & Management

Description

JDSA welcomes all topics that are relevant to data science, computational linguistics, and information sciences. The listed topics of interest are as follows: Big Data Analytics Computational Linguistics Data Clustering and Classifications Data Mining and Data Analytics Data Visualization ...