RABIT: Jurnal Teknologi dan Sistem Informasi Univrab
Vol 6 No 2 (2021): Juli

MENGIDENTIFIKASI HOAX PADA HASIL PENCARIAN BERITA ONLINE DENGAN TEKNIK WEB SCRAPING DAN ALGORITMA C4.5

Diki Arisandi (Program Studi Teknik Informatika, Fakultas Teknik, Universitas Abdurrab)
Zul Indra (Program Studi Teknik Informatika, Fakultas Teknik, Universitas Abdurrab)
Kartini Kartini (Program Studi Teknik Informatika, Fakultas Teknik, Universitas Abdurrab)



Article Info

Publish Date
08 Jul 2021

Abstract

Online news is a journalistic product reports the facts or events that are produced and distributed via internet. However, not all of the information through online media is a real facts, also described as hoax. The large number of hoax news occurs, of course, deliver the impact on the people who look on the news, so it could cause misperceptions or inappropriate actions. We exploit a web scraping technique to extract the content from search search engines results. Furthermore, we employ the C4.5 algorithm for the classification process. There were three parameters as references: invitation to spread the news, credibility of the sources, and provoking title. The results of this work were a decision tree, that able to classify a news content as a hoax or legitimate. From the experiments which carried out, the accuracy of classification using the web scraping and C4.5 algorithm achieved 80% of success rate in determining the hoax.

Copyrights © 2021






Journal Info

Abbrev

rabit

Publisher

Subject

Computer Science & IT Engineering

Description

This journal is called RABIT, where the name comes from two words namely, RAB which means Abdurrab University and IT which means information technology, it can be interpreted as a journal of this journal Journal of Informatics Engineering Study Program Pekanbaru Abdurrab University. This RABIT ...