cover
Contact Name
Teuku Rizky Noviandy
Contact Email
trizkynoviandy@gmail.com
Phone
+6282275731976
Journal Mail Official
editorial-office@heca-analitika.com
Editorial Address
Jl. Makam T. Nyak Arief Kompleks BUPERTA Blok L7B, Lamgapang, Aceh Besar, Provinsi Aceh
Location
Kab. aceh besar,
Aceh
INDONESIA
Infolitika Journal of Data Science
ISSN : -     EISSN : 30258618     DOI : https://doi.org/10.60084/ijds
Infolitika Journal of Data Science is a distinguished international scientific journal that showcases high caliber original research articles and comprehensive review papers in the field of data science. The journals core mission is to stimulate interdisciplinary research collaboration, facilitate the exchange of knowledge, and drive the advancement and application of innovative strategies within the data science domain. Topics of this journal includes, but not limited to Data Mining and Analysis, Machine Learning and Artificial Intelligence, Big Data and Data Engineering, Predictive Modeling and Forecasting, Natural Language Processing, Computer Vision, Data Visualization and Interpretation, Ethics and Privacy in Data Science, Applications of Data Science, Interdisciplinary Approaches
Articles 5 Documents
Search results for , issue "Vol. 1 No. 1 (2023): September 2023" : 5 Documents clear
Machine Learning Approach for Diabetes Detection Using Fine-Tuned XGBoost Algorithm Aga Maulana; Farassa Rani Faisal; Teuku Rizky Noviandy; Tatsa Rizkia; Ghazi Mauer Idroes; Trina Ekawati Tallei; Mohamed El-Shazly; Rinaldi Idroes
Infolitika Journal of Data Science Vol. 1 No. 1 (2023): September 2023
Publisher : Heca Sentra Analitika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.60084/ijds.v1i1.72

Abstract

Diabetes is a chronic condition characterized by elevated blood glucose levels which leads to organ dysfunction and an increased risk of premature death. The global prevalence of diabetes has been rising, necessitating an accurate and timely diagnosis to achieve the most effective management. Recent advancements in the field of machine learning have opened new possibilities for improving diabetes detection and management. In this study, we propose a fine-tuned XGBoost model for diabetes detection. We use the Pima Indian Diabetes dataset and employ a random search for hyperparameter tuning. The fine-tuned XGBoost model is compared with six other popular machine learning models and achieves the highest performance in accuracy, precision, sensitivity, and F1-score. This study demonstrates the potential of the fine-tuned XGBoost model as a robust and efficient tool for diabetes detection. The insights of this study advance medical diagnostics for efficient and personalized management of diabetes.
ANFIS-Based QSRR Modelling for Kovats Retention Index Prediction in Gas Chromatography Rinaldi Idroes; Teuku Rizky Noviandy; Aga Maulana; Rivansyah Suhendra; Novi Reandy Sasmita; Muslem Muslem; Ghazi Mauer Idroes; Raudhatul Jannah; Razief Perucha Fauzie Afidh; Irvanizam Irvanizam
Infolitika Journal of Data Science Vol. 1 No. 1 (2023): September 2023
Publisher : Heca Sentra Analitika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.60084/ijds.v1i1.73

Abstract

This study aims to evaluate the implementation and effectiveness of the Adaptive Neuro-Fuzzy Inference System (ANFIS) based Quantitative Structure Retention Relationship (QSRR) to predict the Kovats retention index of compounds in gas chromatography. The model was trained using 340 essential oil compounds and their molecular descriptors. The evaluation of the ANFIS models revealed promising results, achieving an R2 of 0.974, an RMSE of 48.12, and an MAPE of 3.3% on the testing set. These findings highlight the ANFIS approach as remarkably accurate in its predictive capacity for determining the Kovats retention index in the context of gas chromatography. This study provides valuable perspectives on the efficiency of retention index prediction through ANFIS-based QSRR methods and the potential practicality in compound analysis and chromatographic optimization.
An Implementation of Hybrid CNN-XGBoost Method for Leukemia Detection Problem Taufiq Hidayat; Edrian Hadinata; Irfan Sudahri Damanik; Zakial Vikki; Irvanizam Irvanizam
Infolitika Journal of Data Science Vol. 1 No. 1 (2023): September 2023
Publisher : Heca Sentra Analitika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.60084/ijds.v1i1.87

Abstract

Leukemia is a blood cancer in which blood cells become malignant and uncontrolled. It can cause damage to the function of the body's organs. Several machine learning methods have been used to automatically detect biomedical images, including blood cell images. In this study, we utilized a hybrid machine learning method, called a hybrid Convolutional Neural Network-eXtreme Gradient Boosting (CNN-XGBoost) method to detect leukemia in blood cells. The hybrid method combines two machine learning methods. We use CNN as the basic classifier and XGBoost as the main classification method. The aim of this methodology was to assess whether incorporating the basic classification method would lead to an enhancement in the performance of the main classification model. The experimental findings demonstrated that the utilization of XGBoost as the main classifier led to a marginal increase in accuracy, elevating it from 85.32% to 85.43% compared to the basic CNN classification. This research highlights the potential of hybrid machine learning approaches in biomedical image analysis and their role in advancing the early diagnosis of leukemia and potentially other medical conditions.
Maternal and Child Healthcare Services in Aceh Province, Indonesia: A Correlation and Clustering Analysis in Statistics Novi Reandy Sasmita; Siti Ramadeska; Reksi Utami; Zuhra Adha; Ulayya Putri; Risky Haezah Syarafina; La Ode Reskiaddin; Saiful Kamal; Yarmaliza Yarmaliza; Muliadi Muliadi; Arif Saputra
Infolitika Journal of Data Science Vol. 1 No. 1 (2023): September 2023
Publisher : Heca Sentra Analitika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.60084/ijds.v1i1.88

Abstract

Infant mortality remains a public health problem in Aceh Province, Indonesia. Health services during pregnancy are an essential factor in reducing infant mortality. Studies examining factors such as maternal and child health services that have implications for infant mortality in Aceh province are still scarce. Therefore, this study aims to examine the correlation between maternal and child health services variables such as Blood-Supplementing Tablets (TTD), Coverage of the First Visit of Pregnant Women (K1), Coverage of the First Visit of Pregnant Women (K4), and management of Obstetric Complications to live births and to map the maternal and child health services obtained during pregnancy. A cross-sectional study was used as the research study. This study used descriptive statistics, such as measures of data centering and data dispersion. In this work, inferential statistical analysis was conducted using the Shapiro-Wilk test, Spearman test, and fuzzy c-means. The result of the Shapiro Wilk test stated that the live birth rate variable and all Maternal and Child Healthcare Services variables were not normally distributed (p-value < 0.05), all Maternal and Child Healthcare Services variables were positively correlated to live birth rate based on the Spearman test (p-value < 0.05). Based on the Silhouette Index with 0.555, the formation of 3 clusters is the optimal cluster. The clustering is based on the Maternal and Child Healthcare Services that have been provided, where the first, second, and third clusters consist of five districts/city, eight districts/city, and ten districts/city, respectively, as a result of Fuzzy C-Means Clustering.
Ensemble Machine Learning Approach for Quantitative Structure Activity Relationship Based Drug Discovery: A Review Teuku Rizky Noviandy; Aga Maulana; Ghazi Mauer Idroes; Talha Bin Emran; Trina Ekawati Tallei; Zuchra Helwani; Rinaldi Idroes
Infolitika Journal of Data Science Vol. 1 No. 1 (2023): September 2023
Publisher : Heca Sentra Analitika

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.60084/ijds.v1i1.91

Abstract

This comprehensive review explores the pivotal role of ensemble machine learning techniques in Quantitative Structure-Activity Relationship (QSAR) modeling for drug discovery. It emphasizes the significance of accurate QSAR models in streamlining candidate compound selection and highlights how ensemble methods, including AdaBoost, Gradient Boosting, Random Forest, Extra Trees, XGBoost, LightGBM, and CatBoost, effectively address challenges such as overfitting and noisy data. The review presents recent applications of ensemble learning in both classification and regression tasks within QSAR, showcasing the exceptional predictive accuracy of these techniques across diverse datasets and target properties. It also discusses the key challenges and considerations in ensemble QSAR modeling, including data quality, model selection, computational resources, and overfitting. The review outlines future directions in ensemble QSAR modeling, including the integration of multi-modal data, explainability, handling imbalanced data, automation, and personalized medicine applications while emphasizing the need for ethical and regulatory guidelines in this evolving field.

Page 1 of 1 | Total Record : 5