Personality Classification from Online Text using Machine Learning Approach

被引:0
|
作者
Khan, Alam Sher [1 ]
Ahmad, Hussain [1 ]
Asghar, Muhammad Zubair [1 ]
Saddozai, Furcian Khan [1 ]
Arir, Areeba [1 ]
Khalid, Hassan Ali [1 ]
机构
[1] Gomal Univ, Inst Comp & Informat Technol, Dera Ismail Khan, Pakistan
关键词
Personality recognition; re-sampling; machine learning; XGBoost; class imbalanced; MBTI; social networks; SOCIAL MEDIA;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Personality refer to the distinctive set of characteristics of a person that effect their habits, behaviour's, attitude and pattern of thoughts. Text available on Social Networking sites provide an opportunity to recognize individual's personality traits automatically. In this proposed work, Machine Learning Technique, XGBoost classifier is used to predict four personality traits based on Myers- Briggs Type Indicator (MBTI) model, namely Introversion-Extroversion(I-E), iNtuition-Sensing(N-S), Feeling-Thinking(F-T) and Judging-Perceiving(J-P) from input text. Publically available benchmark dataset from Kaggle is used in experiments. The skewness of the dataset is the main issue associated with the prior work, which is minimized by applying Re-sampling technique namely random over-sampling, resulting in better performance. For more exploration of the personality from text, pre-processing techniques including tokenization, word stemming, stop words elimination and feature selection using TF IDF are also exploited. This work provides the basis for developing a personality identification system which could assist organization for recruiting and selecting appropriate personnel and to improve their business by knowing the personality and preferences of their customers. The results obtained by all classifiers across all personality traits is good enough, however, the performance of XGBoost classifier is outstanding by achieving more than 99% precision and accuracy for different traits.
引用
收藏
页码:460 / 476
页数:17
相关论文
共 50 条
  • [31] Text Classification and Machine Learning Support for Requirements Analysis Using Blogs
    Lange, Douglas S.
    [J]. INNOVATIONS FOR REQUIREMENTS ANALYSIS: FROM STAKEHOLDERS' NEEDS TO FORMAL DESIGNS, 2008, 5320 : 182 - 195
  • [32] DOMAIN SPECIFIC SYNTAX BASED APPROACH FOR TEXT CLASSIFICATION IN MACHINE LEARNING CONTEXT
    Mohasseb, Alaa
    Bader-El-Den, Mohamed
    Liu, Han
    Cocea, Mihaela
    [J]. PROCEEDINGS OF 2017 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOL 2, 2017, : 652 - 657
  • [33] Detection and Classification of Psychopathic Personality Trait from Social Media Text Using Deep Learning Model
    Asghar, Junaid
    Akbar, Saima
    Asghar, Muhammad Zubair
    Ahmad, Bashir
    Al-Rakhami, Mabrook S.
    Gumaei, Abdu
    [J]. COMPUTATIONAL AND MATHEMATICAL METHODS IN MEDICINE, 2021, 2021
  • [34] Toward Extracting Information from Public Health Statutes using Text Classification and Machine Learning
    Grabmair, Matthias
    Ashley, Kevin D.
    Hwa, Rebecca
    Sweeney, Patricia M.
    [J]. Legal Knowledge and Information Systems, 2011, 235 : 73 - 82
  • [35] Automatic tortuosity classification using machine learning approach
    Turior, Rashmi
    Chutinantvarodom, Pornthep
    Uyyanonvara, Bunyarit
    [J]. INDUSTRIAL INSTRUMENTATION AND CONTROL SYSTEMS, PTS 1-4, 2013, 241-244 : 3143 - 3147
  • [36] A deep learning approach for image and text classification using neutrosophy
    Wajid M.A.
    Zafar A.
    Wajid M.S.
    [J]. International Journal of Information Technology, 2024, 16 (2) : 853 - 859
  • [37] Emotional state classification from EEG data using machine learning approach
    Wang, Xiao-Wei
    Nie, Dan
    Lu, Bao-Liang
    [J]. NEUROCOMPUTING, 2014, 129 : 94 - 106
  • [38] Classification of Diabetic Foot Ulcers from Images Using Machine Learning Approach
    Almufadi, Nouf
    Alhasson, Haifa F.
    [J]. DIAGNOSTICS, 2024, 14 (16)
  • [39] A Review of Machine Learning Algorithms for Text Classification
    Li, Ruiguang
    Liu, Ming
    Xu, Dawei
    Gao, Jiaqi
    Wu, Fudong
    Zhu, Liehuang
    [J]. CYBER SECURITY, CNCERT 2021, 2022, 1506 : 226 - 234
  • [40] Application of machine learning method in text classification
    Sui, Zhenhuan
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 120 - 120