cover
Contact Name
Aji Prasetya Wibawa
Contact Email
aji.prasetya.ft@um.ac.id
Phone
+62818539333
Journal Mail Official
keds.journal@um.ac.id
Editorial Address
Gedung G4. Lantai 1 Jl. Semarang No.5, Malang
Location
Kota malang,
Jawa timur
INDONESIA
Knowledge Engineering and Data Science
ISSN : -     EISSN : 25974637     DOI : http://dx.doi.org/10.17977
Knowledge Engineering and Data Science (2597-4637), KEDS, brings together researchers, industry practitioners, and potential users, to promote collaborations, exchange ideas and practices, discuss new opportunities, and investigate analytics frameworks on data-driven and knowledge base systems.
Articles 7 Documents
Search results for , issue "Vol 6, No 2 (2023)" : 7 Documents clear
The Effect of the Number of Hidden Layers on The Performance of Deep Q-Network for Traveling Salesman Problem Benzfica Hanif; Aisyah Larasati; Rudi Nurdiansyah; Trung Le
Knowledge Engineering and Data Science Vol 6, No 2 (2023)
Publisher : Universitas Negeri Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.17977/um018v6i22023p188-198

Abstract

The Traveling Salesman Problem (TSP) effectively represents the complex distribution issues encountered by couriers, who must carefully plan a route that includes all customer addresses while minimizing the distance traveled. As the magnitude of deliveries and the range of destinations expand, the courier's responsibility becomes progressively challenging. In this particular context, the objective of our research is to expand the existing knowledge and explore the complete capabilities of Deep Q-Network (DQN) models in order to achieve the most efficient route determination. This endeavor can potentially bring about significant changes in the courier and delivery service sector. The foundation of our unique methodology relies on an empirical inquiry, utilizing a comprehensive dataset including 178 observations obtained from motorcycle-based package delivery agents. Our research is carefully planned and executed using a comprehensive factorial experimental design. This design incorporates three crucial factors: the number of hidden layers, episodes, and epochs. The hidden layer parameter is set to a singular level, while the episode parameter is configured to explore five levels, and the epoch parameter is designed to travel four levels. The evaluation of our DQN models' performance is conducted utilizing the MSE metric as a measure. This assessment is carried out at every iterative cycle, ensuring thorough scrutiny. The central focus of our research centers on the intricate connection between episodes and epochs, and their influence on MSE. The findings of our study reveal that the association between episodes, epochs, and errors is not statistically significant although different level of episodes and epochs produces slightly different level of error.
Evidence of Students’ Academic Performance at the Federal College of Education Asaba Nigeria: Mining Education Data Arnold Adimabua Ojugoa; Christopher Chukwufunaya Odiakaose; Frances Emordi; Rita Erhovwo Ako; Winifred Adigwe; Kizito Eluemonor Anazia; Victor Geteloma
Knowledge Engineering and Data Science Vol 6, No 2 (2023)
Publisher : Universitas Negeri Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.17977/um018v6i22023p145-156

Abstract

One main objective of higher education is to provide quality education to its students. One way to achieve the highest level of quality in the higher education system is by discovering knowledge for prediction regarding enrolment of students in a particular course, alienation of traditional classroom teaching model, detection of unfair means used in online examination, detection of abnormal values in the result sheets of the students, and prediction about students’ performance. The knowledge is hidden among the educational data set and is extractable through data mining techniques. The present paper is designed to justify the capabilities of data mining techniques in the context of higher education by offering a data mining model for the higher education system in the university. In this research, the classification task is used to evaluate student’s performance, and as many approaches are used for data classification, the decision tree method is used here. By this, we extract data that describes students’ summative performance at semester’s end, helps to identify the dropouts and students who need special attention, and allows the teacher to provide appropriate advising/counseling.
Multivariate Analysis Approach to Factor-Affected Tuberculosis Disease Zuli Agustina Gultom; Farid Akbar Siregar; Mahardika Abdi Prawira Tanjung; Al-Hamidy Hazidar
Knowledge Engineering and Data Science Vol 6, No 2 (2023)
Publisher : Universitas Negeri Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.17977/um018v6i22023p114-128

Abstract

Tuberculosis is a disease caused by infection with the mycobacterium tuberculosis complex. Tuberculosis attack organ besides the lung, such as the pleura, lining of the brain, lining of the heart, lymph gland, bones, joint, skin, intestines, kidney, urinary tract, and genital. This disease is found in densely populated settlements with poor sanitation, lack of ventilation and sunlight and lack of rest. Moreover, the factors that will be analyzed in this research are Population Density (X1), Number of HIV/AIDS (X2), number of toddlers who experience nutrition (X3), Number of toddlers who experience BCG immunization (X4), number of toddlers who get exclusive breastfeeding (X5), Total families with PHBS (X6), number of residents with healthy homes (X7), number of families with clean water facilities (X8), number of families with ownership of latrine sanitation (X9), number of families with have landfills (X10), number of families have management waste place (X11), number of elementary education facilities (X12), Number of junior school education facilities (X13), Number of senior school education facilities (X14), Number of institutions fostered by neighborhood health (X15), Number of Posyandu (X16), Number Life Expectancy (X17), Literacy Rate (X18), Human Development Index (X19), Number of Tuberculosis sufferers (X20). This research aims to analyze what variables influence each other on the prevalence rate of tuberculosis in the city of Surabaya. The method used in this research is a multivariate analysis using factor analysis, cluster analysis, biplot analysis and discriminant analysis. This discriminant analysis determines accuracy by calculating the value (1-APER). The resulting research the Number of HIV/AIDS, number of residents with healthy homes, and Number of families with ownership of Sanitation (latrine, landfills, waste management) have a high correlation with the spread of tuberculosis in Surabaya. Meanwhile, areas with a high rate of tuberculosis are Tambaksari, Wonokromo, Sawahan, and Semampir.  The classification analysis accuracy level was 90.32% and the accuracy of the resulting model or discriminant function was very high. So that discriminant analysis can be used for predicting the accuracy of tuberculosis prevalence rates.
Systematic Literature Review on Ontology-based Indonesian Question Answering System Fadhila Tangguh Admojo; Adidah Lajis; Haidawati Nasir
Knowledge Engineering and Data Science Vol 6, No 2 (2023)
Publisher : Universitas Negeri Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.17977/um018v6i22023p129-144

Abstract

Question-Answering (QA) systems at the intersection of natural language processing, information retrieval, and knowledge representation aim to provide efficient responses to natural language queries. These systems have seen extensive development in English and languages like Indonesian present unique challenges and opportunities. This literature review paper delves into the state of ontology-based Indonesian QA systems, highlighting critical challenges. The first challenge lies in sentence understanding, variations, and complexity. Most systems rely on syntactic analysis and struggle to grasp sentence semantics. Complex sentences, especially in Indonesian, pose difficulties in parsing, semantic interpretation, and knowledge extraction. Addressing these linguistic intricacies is pivotal for accurate responses. Secondly, template-based SPARQL query construction, commonly used in Indonesian QA systems, suffers from semantic gaps and inflexibility. Advanced techniques like semantic matching algorithms and dynamic template generation can bridge these gaps and adapt to evolving ontologies. Thirdly, lexical gaps and ambiguity hinder QA systems. Bridging vocabulary mismatches between user queries and ontology labels remains a challenge. Strategies like synonym expansion, word embedding, and ontology enrichment must be explored further to overcome these challenges. Lastly, the review discusses the potential of developing multi-domain ontologies to broaden the knowledge coverage of QA systems. While this presents complex linguistic and ontological challenges, it offers the advantage of responding to various user queries across various domains. This literature review identifies crucial challenges in developing ontology-based Indonesian QA systems and suggests innovative approaches to address these challenges.
Recurrent Session Approach to Generative Association Rule based Recommendation Tubagus Arief Armanda; Ire Puspa Wardhani; Tubagus M. Akhriza; Tubagus M. Adrie Admira
Knowledge Engineering and Data Science Vol 6, No 2 (2023)
Publisher : Universitas Negeri Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.17977/um018v6i22023p199-214

Abstract

This article introduces a generative association rule (AR)-based recommendation system (RS) using a recurrent neural network approach implemented when a user searches for an item in a browsing session. It is proposed to overcome the limitations of the traditional AR-based RS which implements query-based sessions that are not adaptive to input series, thus failing to generate recommendations.  The dataset used is accurate retail transaction data from online stores in Europe. The contribution of the proposed method is a next-item prediction model using LSTM, but what is trained to develop the model is an associative rule string, not a string of items in a purchase transaction. The proposed model predicts the next item generatively, while the traditional method discriminatively. As a result, for an array of items that the user has viewed in a browsing session, the model can always recommend the following items when traditional methods cannot.  In addition, the results of user-centered validation of several metrics show that although the level of accuracy (similarity) of recommended products and products seen by users is only 20%, other metrics reach above 70%, such as novelty, diversity, attractiveness and enjoyability.
Deep Learning Approaches with Optimum Alpha for Energy Usage Forecasting Aji Prasetya Wibawa; Agung Bella Putra Utama; Ade Kurnia Ganesh Akbari; Akhmad Fanny Fadhilla; Alfiansyah Putra Pertama Triono; Andien Khansa’a Iffat Paramarta; Faradini Usha Setyaputri; Leonel Hernandez
Knowledge Engineering and Data Science Vol 6, No 2 (2023)
Publisher : Universitas Negeri Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.17977/um018v6i22023p170-187

Abstract

Energy use is an essential aspect of many human activities, from individual to industrial scale. However, increasing global energy demand and the challenges posed by environmental change make understanding energy use patterns crucial. Accurate predictions of future energy consumption can greatly influence decision-making, supply-demand stability and energy efficiency. Energy use data often exhibits time-series patterns, which creates complexity in forecasting. To address this complexity, this research utilizes Deep Learning (DL), Convolutional Neural Networks (CNN), Recurrent Neural Networks (RNN), Long Short-term Memory (LSTM), Bidirectional LSTM (Bi-LSTM), and Gated Recurrent Unit (GRU) models. The main objective is to improve the accuracy of energy usage forecasting by optimizing the alpha value in exponential smoothing, thereby improving forecasting accuracy. The results showed that all DL methods experienced improved accuracy when using optimum alpha. LSTM has the most optimal MAPE, RMSE, and R2 values compared to other methods. This research promotes energy management, decision-making, and efficiency by providing an innovative framework for accurate forecasting of energy use, thus contributing to a sustainable and efficient energy system.
EEG Classification while Listening to Murottal Al-Quran and Classical Music using Random Forest Method Heni Sumarti; Fahira Septiani; Agus Sudarmanto; Wahyu Caesarendra; Rizki Edmi Edison
Knowledge Engineering and Data Science Vol 6, No 2 (2023)
Publisher : Universitas Negeri Malang

Show Abstract | Download Original | Original Source | Check in Google Scholar | DOI: 10.17977/um018v6i22023p157-169

Abstract

This study is aimed to classify the brain activity of adolescents associated with audio stimuli; murottal Al-Quran and classical music.  The raw data were filtered using Independent Component Analisys (ICA) and followed by band-pass filter in Python on the Google Colab Extraction was processed with Power Spectral Density (PSD) and the Random Forest Method in Weka Machine Learning was used for classification.  The research results showed the same results between the two types of stimulation, namely the order of brain waves from highest to lowest were delta, alpha, theta and beta. The average brain waves of teenagers when given murottal al-Quran stimulation were 45.32% delta, 31.60% alpha, 17.02 theta and 6.05% beta. Meanwhile, the average brain waves of teenagers when given classical music stimulation were 46.54% delta, 28.64% alpha, 19.21% theta and 5.50% beta. Classification is obtained with the best value that frequently appears (mode) from the prediction results for each sample using random forest methods. The accuracy, precision, and recall of classifying adolescent brain waves when given murottal and classical music stimuli using the Random Forest method with cross-validation technique (optimum at k-fold=5) were 65.38%, 76.92%, and 70.00%, respectively.  The results of this study show that stimulation using murottal al-Quran and classical music effectively improves adolescent relaxation conditions.

Page 1 of 1 | Total Record : 7