Journal of Computer Scine and Information Technology
Volume 7 Issue 3 (2021): JCSITech

Alliance Rules- based Algorithm on Detecting Duplicate Entry Email

Arif Hanafi (Universiti Malaysia Pahang)
Sulaiman Harun (Asia Pacific University, Kuala Lumpur, Malaysia)
Sofika Enggari (Universitas Putra Indonesia YPTK Padang)
Larissa Navia Rani (Universitas Putra Indonesia YPTK Padang)



Article Info

Publish Date
30 Jul 2021

Abstract

The way that email has extraordinary significance in present day business communication is certain. Consistently, a bulk of emails is sent from organizations to clients and suppliers, from representatives to their managers and starting with one colleague then onto the next. In this way there is vast of email in data warehouse. Data cleaning is an activity performed on the data sets of data warehouse to upgrade and keep up the quality and consistency of the data. This paper underlines the issues related with dirty data, detection of duplicatein email column. The paper identifies the strategy of data cleaning from adifferent point of view. It provides an algorithm to the discovery of error and duplicates entries in the data sets of existing data warehouse. The paper characterizes the alliance rules based on the concept of mathematical association rules to determine the duplicate entries in email column in data sets.

Copyrights © 2021






Journal Info

Abbrev

jcsitech

Publisher

Subject

Computer Science & IT Control & Systems Engineering Electrical & Electronics Engineering Engineering Materials Science & Nanotechnology

Description

Journal of Computer Science and Information Technology is a threetly journal published by Universitas Putra Indonesia YPTK, Padang. It publishes scientific and technical papers describing original research work or novel product/process development. The objectives are to promote exchange of ...