Claim Missing Document
Check
Articles

Found 8 Documents
Search

Pancreatic cancer classification using logistic regression and random forest Zuherman Rustam; Fildzah Zhafarina; Glori Stephani Saragih; Sri Hartini
IAES International Journal of Artificial Intelligence (IJ-AI) Vol 10, No 2: June 2021
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijai.v10.i2.pp476-481

Abstract

In the medical field, technology machinery is needed to solve several classification problems. Therefore, this research is useful to solve the problem of the medical field by using machine learning. This study discusses the classification of pancreatic cancer by using regression logistics and random forest. By comparing the accuracy, precision, recall (sensitivity), and F1-score of both methods, then we will know which method is better in classifying the pancreatic cancer dataset that we get from Al-Islam Hospital, Bandung, Indonesia. The results showed that random forest has better accuracy than logistic regressions. It can be seen with maximum accuracy of logistic regressions 96.48 with 30% data training and random forest 99.38% with 20% of data training.
Comparison some of kernel functions with support vector machines classifier for thalassemia dataset Ilsya Wirasati; Zuherman Rustam; Jane Eva Aurelia; Sri Hartini; Glori Stephani Saragih
IAES International Journal of Artificial Intelligence (IJ-AI) Vol 10, No 2: June 2021
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijai.v10.i2.pp430-437

Abstract

In the medical field, accurate classification of medical data is really important because of its impact on disease detection and patient’s treatment. Technology, machine learning, is needed to help medical staff to improve accuracy to classify disease. This research discussed some kernel functions, such as gaussian radial basis function (RBF) kernel, Polynomial kernel, and linear kernel with support vector machine (SVM) to classify thalassemia data. Thalassemia is a genetic blood disorder which is also one of the major public health problems. In this paper, there is an explanation about thalassemia, SVM, and some of the kernel functions that serve as a comprehensive source for the next research about this topic. Furthermore, there is a comparison result from three kernel functions to find out which one has the best performance. The result is gaussian RBF kernel with SVM is the best method with an average of accuracy 99,63%.
Lung cancer classification using fuzzy c-means and fuzzy kernel C-Means based on CT scan image Zuherman Rustam; Aldi Purwanto; Sri Hartini; Glori Stephani Saragih
IAES International Journal of Artificial Intelligence (IJ-AI) Vol 10, No 2: June 2021
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijai.v10.i2.pp291-297

Abstract

Cancer is one of the diseases with the highest mortality rate in the world. Cancer is a disease when abnormal cells grow out of control that can attack the body's organs side by side or spread to other organs. Lung cancer is a condition when malignant cells form in the lungs. To diagnose lung cancer can be done by taking x-ray images, CT scans, and lung tissue biopsy. In this modern era, technology is expected to help research in the field of health. Therefore, in this study feature extraction from CT images was used as data to classify lung cancer. We used CT scan image data from SPIE-AAPM Lung CT challenge 2015. Fuzzy C-Means and fuzzy kernel C-Means were used to classify the lung nodule from the patient into benign or malignant. Fuzzy C-Means is a soft clustering method that uses Euclidean distance to calculate the cluster center and membership matrix. Whereas fuzzy kernel C-Means uses kernel distance to calculate it. In addition, the support vector machine was used in another study to obtain 72% average AUC. Simulations were performed using different k-folds. The score showed fuzzy kernel C-Means had the highest accuracy of 74%, while fuzzy C-Means obtained 73% accuracy.
Acute sinusitis data classification using grey wolf optimization-based support vector machine Ajeng Maharani Putri; Zuherman Rustam; Jacub Pandelaki; Ilsya Wirasati; Sri Hartini
IAES International Journal of Artificial Intelligence (IJ-AI) Vol 10, No 2: June 2021
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijai.v10.i2.pp438-445

Abstract

Acute sinusitis is the most common form of sinusitis, and it causes swelling and inflammation within the nose. The main thing that can causes sinusitis is probably due to viruses, and also can be caused by other factors, namely bacteria, fungi, irritation, dust, and allergens. In this research, the CT scan data attributes will be used for classification and grey wolf optimization-support vector machine (GWO-SVM) will be the machine learning technique used, where the GWO technique will be used to tuned the parameters in SVM. The performance of methods was analyzed using the python programming language with different percentages of training data, which started from 10% to 90%. The GWO-SVM method proposed provides better accuracy than using SVM without GWO.
Hepatitis classification using support vector machines and random forest Jane Eva Aurelia; Zuherman Rustam; Ilsya Wirasati; Sri Hartini; Glori Stephani Saragih
IAES International Journal of Artificial Intelligence (IJ-AI) Vol 10, No 2: June 2021
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijai.v10.i2.pp446-451

Abstract

Hepatitis is a medical condition defined by inflammation of the liver. It can be caused by infection of the liver by hepatitis viruses or is of unknown aetiology. There are 5 main hepatitis viruses, such as virus types A, B, C, D and E. The infection may occur with limited or no symptoms, but also may include some symptoms like abdominal pain, dark urine, extreme fatigue, jaundice, nausea or vomiting. Because Indonesia is a large archipelago, the prevalence of viral infections varies greatly by region of acute hepatitis patients. This research uses data of hepatitis examination result with amount of 113 data and 5 features. The method that used is support vector machines (SVM) and random forest method. SVM is the classification method that uses discriminant hyper-plane, dividing to classes. meanwhile, random forest is a tree-based ensemble depending on a collection of random variables. SVM and random forest (RF) are applied to predict hepatitis data, and then the results will be compared.
Linear discriminant analysis and support vector machines for classifying breast cancer Zuherman Rustam; Yasirly Amalia; Sri Hartini; Glori Stephani Saragih
IAES International Journal of Artificial Intelligence (IJ-AI) Vol 10, No 1: March 2021
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.11591/ijai.v10.i1.pp253-256

Abstract

Breast cancer is an abnormal cell growth in the breast that keeps changed uncontrolled and it forms a tumor. The tumor can be benign or malignant. Benign could not be dangerous to health and cancerous, but malignant could be has a probability dangerous to health and be cancerous. A specialist doctor will diagnose the patient and give treatment based on the diagnosis which is benign or malignant. Machine learning offer times efficiency to determine a cancer cell. The machine will learn the pattern based on the information from the dataset. Support vector machines and linear discriminant analysis are common methods that can be used in the classification of cancer. In this study, both of linear discriminant analysis and support vector machines are compared by looking from accuracy, sensitivity, specificity, and F1-score. We will know which methods are better in classifying breast cancer dataset. The result shows that the support vector machine has better performance than the linear discriminant analysis. It can be seen from the accuracy is 98.77%.
The comparison study of kernel KC-means and support vector machines for classifying schizophrenia Sri Hartini; Zuherman Rustam
TELKOMNIKA (Telecommunication Computing Electronics and Control) Vol 18, No 3: June 2020
Publisher : Universitas Ahmad Dahlan

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.12928/telkomnika.v18i3.14847

Abstract

Schizophrenia is one of mental disorder that affects the mind, feeling, and behavior. Its treatment is usually permanent and quite complicated; therefore, early detection is important. Kernel KC-means and support vector machines are the methods known as a good classifier. This research, therefore, aims to compare kernel KC-means and support vector machines, using data obtained from Northwestern University, which consists of 171 schizophrenia and 221 non-schizophrenia samples. The performance accuracy, F1-score, and running time were examined using the 10-fold cross-validation method. From the experiments, kernel KC-means with the sixth-order polynomial kernel gives 87.18 percent accuracy and 93.15 percent F1-score at the faster running time than support vector machines. However, with the same kernel, it was further deduced from the results that support vector machines provides better performance with an accuracy of 88.78 percent and F1-score of 94.05 percent.
New feature selection based on kernel Zuherman Rustam; Sri Hartini
Bulletin of Electrical Engineering and Informatics Vol 9, No 4: August 2020
Publisher : Institute of Advanced Engineering and Science

Show Abstract | Download Original | Original Source | Check in Google Scholar | Full PDF (488.899 KB) | DOI: 10.11591/eei.v9i4.1959

Abstract

Feature selection is an essential issue in machine learning. It discards the unnecessary or redundant features in the dataset. This paper introduced the new feature selection based on kernel function using 16 the real-world datasets from UCI data repository, and k-means clustering was utilized as the classifier using radial basis function (RBF) and polynomial kernel function. After sorting the features using the new feature selection, 75 percent of it was examined and evaluated using 10-fold cross-validation, then the accuracy, F1-Score, and running time were compared. From the experiments, it was concluded that the performance of the new feature selection based on RBF kernel function varied according to the value of the kernel parameter, opposite with the polynomial kernel function. Moreover, the new feature selection based on RBF has a faster running time compared to the polynomial kernel function. Besides, the proposed method has higher accuracy and F1-Score until 40 percent difference in several datasets compared to the commonly used feature selection techniques such as Fisher score, Chi-Square test, and Laplacian score. Therefore, this method can be considered to use for feature selection