Knowledge Engineering and Data Science
Vol 5, No 2 (2022)

Can Multinomial Logistic Regression Predicts Research Group using Text Input?

Harits Ar Rosyid (Universitas Negeri Malang, Indonesia)
Aulia Yahya Harindra Putra (Universitas Negeri Malang)
Muhammad Iqbal Akbar (Universitas Negeri Malang)
Felix Andika Dwiyanto (AGH University of Science and Technology)



Article Info

Publish Date
30 Dec 2022

Abstract

While submitting proposals in SISINTA, students often confuse or falsely submit their proposals to the less relevant or incorrect research group. There are 13 research groups for the students to choose from. We proposed a text classification method to help students find the best research group based on the title and/or abstract. The stages in this study include data collection, preprocessing data, classification using Logistic Regression, and evaluation of the results. Three scenarios in research group classification are based on 1) title only, 2) abstract only, and 3) title and abstract. Based on the experiments, research group classification using title-only input is the best overall. This scenario gets the most optimal results with accuracy, precision, recall, and f1-score successively at 63.68%, 64.91%, 63.68%, and 63.46%. This result is sufficient to help students find the best research group based on the text titles. In addition, lecturers can comment more elaborately since the proposals are relevant to the research group’s scope.

Copyrights © 2022






Journal Info

Abbrev

keds

Publisher

Subject

Computer Science & IT Engineering

Description

Knowledge Engineering and Data Science (2597-4637), KEDS, brings together researchers, industry practitioners, and potential users, to promote collaborations, exchange ideas and practices, discuss new opportunities, and investigate analytics frameworks on data-driven and knowledge base systems. ...