Perfecting a Video Game with Game Metrics
Vol 18, No 2: April 2020

Genomic repeats detection using Boyer-Moore algorithm on Apache Spark Streaming

Lala Septem Riza (Universitas Pendidikan Indonesia)
Farhan Dhiyaa Pratama (Universitas Pendidikan Indonesia)
Erna Piantari (Universitas Pendidikan Indonesia)
Mahmoud Fahsi (Djillali Liabes University)



Article Info

Publish Date
01 Apr 2020

Abstract

Genomic repeats, i.e., pattern searching in the string processing process to find repeated base pairs in the order of Deoxyribonucleic Acid (DNA), requires a long processing time. This research builds a big-data computational model to look for patterns in strings by modifying and implementing the Boyer-Moore algorithm on Apache Spark Streaming for human DNA sequences from the Ensemble site. Moreover, we perform some experiments on cloud computing by varying different specifications of computer clusters with involving datasets of human DNA sequences. The results obtained show that the proposed computational model on Apache Spark Streaming is faster than standalone computing and parallel computing with multicore. Therefore, it can be stated that the main contribution in this research, which is to develop a computational model for reducing the computational costs, has been achieved.

Copyrights © 2020






Journal Info

Abbrev

TELKOMNIKA

Publisher

Subject

Computer Science & IT

Description

Submitted papers are evaluated by anonymous referees by single blind peer review for contribution, originality, relevance, and presentation. The Editor shall inform you of the results of the review as soon as possible, hopefully in 10 weeks. Please notice that because of the great number of ...