Personality Classification from Online Text using Machine Learning Approach

被引:0
|
作者
Khan, Alam Sher [1 ]
Ahmad, Hussain [1 ]
Asghar, Muhammad Zubair [1 ]
Saddozai, Furcian Khan [1 ]
Arir, Areeba [1 ]
Khalid, Hassan Ali [1 ]
机构
[1] Gomal Univ, Inst Comp & Informat Technol, Dera Ismail Khan, Pakistan
关键词
Personality recognition; re-sampling; machine learning; XGBoost; class imbalanced; MBTI; social networks; SOCIAL MEDIA;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Personality refer to the distinctive set of characteristics of a person that effect their habits, behaviour's, attitude and pattern of thoughts. Text available on Social Networking sites provide an opportunity to recognize individual's personality traits automatically. In this proposed work, Machine Learning Technique, XGBoost classifier is used to predict four personality traits based on Myers- Briggs Type Indicator (MBTI) model, namely Introversion-Extroversion(I-E), iNtuition-Sensing(N-S), Feeling-Thinking(F-T) and Judging-Perceiving(J-P) from input text. Publically available benchmark dataset from Kaggle is used in experiments. The skewness of the dataset is the main issue associated with the prior work, which is minimized by applying Re-sampling technique namely random over-sampling, resulting in better performance. For more exploration of the personality from text, pre-processing techniques including tokenization, word stemming, stop words elimination and feature selection using TF IDF are also exploited. This work provides the basis for developing a personality identification system which could assist organization for recruiting and selecting appropriate personnel and to improve their business by knowing the personality and preferences of their customers. The results obtained by all classifiers across all personality traits is good enough, however, the performance of XGBoost classifier is outstanding by achieving more than 99% precision and accuracy for different traits.
引用
收藏
页码:460 / 476
页数:17
相关论文
共 50 条
  • [41] Machine learning for Asian language text classification
    Peng, Fuchun
    Huang, Xiangji
    [J]. JOURNAL OF DOCUMENTATION, 2007, 63 (03) : 378 - 397
  • [42] A machine learning approach to knowledge acquisitions from text databases
    Sakakibara, Y
    Misue, K
    Koshiba, T
    [J]. INTERNATIONAL JOURNAL OF HUMAN-COMPUTER INTERACTION, 1996, 8 (03) : 309 - 324
  • [43] Online classification of user activities using machine learning on network traffic
    Labayen, Victor
    Magana, Eduardo
    Morato, Daniel
    Izal, Mikel
    [J]. COMPUTER NETWORKS, 2020, 181
  • [44] Fruit Classification Using Traditional Machine Learning and Deep Learning Approach
    Saranya, N.
    Srinivasan, K.
    Kumar, S. K. Pravin
    Rukkumani, V
    Ramya, R.
    [J]. COMPUTATIONAL VISION AND BIO-INSPIRED COMPUTING, 2020, 1108 : 79 - 89
  • [45] ONLINE FAKE JOB ADVERTISEMENT RECOGNITION AND CLASSIFICATION USING MACHINE LEARNING
    Alandjani, Gasim Othman
    [J]. 3C TIC, 2022, 11 (01): : 251 - 266
  • [46] Empirical Study of Online News Classification Using Machine Learning Approaches
    Suleymanov, Umid
    Rustamov, Samir
    Zulfugarov, Murad
    Orujov, Orkhan
    Musayev, Nadir
    Alizade, Azar
    [J]. 2018 IEEE 12TH INTERNATIONAL CONFERENCE ON APPLICATION OF INFORMATION AND COMMUNICATION TECHNOLOGIES (AICT), 2018, : 152 - 157
  • [47] A Machine Learning Approach to Classification of Case Reports on Adverse Drug Reactions Using Text Mining of Expert Opinions
    Kim, Hyon Hee
    Rhew, Ki Yon
    [J]. ADVANCES IN COMPUTER SCIENCE AND UBIQUITOUS COMPUTING, 2018, 474 : 1072 - 1077
  • [48] Text Classification in Clinical Practice Guidelines Using Machine-Learning Assisted Pattern-Based Approach
    Hussain, Musarrat
    Hussain, Jamil
    Ali, Taqdir
    Ali, Syed Imran
    Bilal, Hafiz Syed Muhammad
    Lee, Sungyoung
    Chung, Taechoong
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (08):
  • [49] A Feature-Aware Online Learning Approach for Support Vector Machine Classification
    Liu, Fang
    Lee, Kee Jin
    Hong, Jihoon
    [J]. 2018 15TH INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV), 2018, : 254 - 259
  • [50] Machine Learning Approach to Classification of Online Users by Exploiting Information Seeking Behavior
    Kanwal, Moona
    Khan, Muzammil Ahmad
    Ismat, Najma
    Khan, Najeed A.
    Khan, Aftab A.
    [J]. IEEE ACCESS, 2024, 12 : 53234 - 53249