Journal of Applied Data Sciences
Vol 3, No 2: MAY 2022

Big Data Classification of Personality Types Based on Respondents’ Big Five Personality Traits

Jennifer Chi (School of Behavioral and Brain Sciences, University of Texas at Dallas, Richardson, USA)



Article Info

Publish Date
24 May 2022

Abstract

A mixed model was introduced in this study, k-means clustering analysis for data examination, discriminant analysis for classification, and multilayer perceptron neural network analysis for prediction. After deleted inadequate samples and outliers, total number of observations was 1,009,998 for this study that was collected through on interactive online personality (i.e., big five personality traits) test in 2018. Empirical results based on the k-means clustering analysis identified four different personality clusters using the total score of big five personality traits (Extraversion, Neuroticism, Agreeableness, Conscientiousness, and Openness to Experience). Results of the k-means clustering analysis were tested for accuracy using the discriminant analysis indicated that cluster means were significantly different, and showed that 95.8% of original grouped cases correctly classified. The multilayer perceptron neural network framework was utilized as a predictive model, showed a 5-5-4 neural network construction, in deciding the personality classification of participants: Training 99.5% of training grouped cases and 99.5% of testing grouped cases correctly classified. Results of this study may provide insight into the understanding of the personality of participants for further psychological, social, cultural, and economic considerations.

Copyrights © 2022






Journal Info

Abbrev

JADS

Publisher

Subject

Computer Science & IT Control & Systems Engineering Decision Sciences, Operations Research & Management

Description

One of the current hot topics in science is data: how can datasets be used in scientific and scholarly research in a more reliable, citable and accountable way? Data is of paramount importance to scientific progress, yet most research data remains private. Enhancing the transparency of the processes ...