Personality Classification from Online Text using Machine Learning Approach

被引:0
|
作者
Khan, Alam Sher [1 ]
Ahmad, Hussain [1 ]
Asghar, Muhammad Zubair [1 ]
Saddozai, Furcian Khan [1 ]
Arir, Areeba [1 ]
Khalid, Hassan Ali [1 ]
机构
[1] Gomal Univ, Inst Comp & Informat Technol, Dera Ismail Khan, Pakistan
关键词
Personality recognition; re-sampling; machine learning; XGBoost; class imbalanced; MBTI; social networks; SOCIAL MEDIA;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Personality refer to the distinctive set of characteristics of a person that effect their habits, behaviour's, attitude and pattern of thoughts. Text available on Social Networking sites provide an opportunity to recognize individual's personality traits automatically. In this proposed work, Machine Learning Technique, XGBoost classifier is used to predict four personality traits based on Myers- Briggs Type Indicator (MBTI) model, namely Introversion-Extroversion(I-E), iNtuition-Sensing(N-S), Feeling-Thinking(F-T) and Judging-Perceiving(J-P) from input text. Publically available benchmark dataset from Kaggle is used in experiments. The skewness of the dataset is the main issue associated with the prior work, which is minimized by applying Re-sampling technique namely random over-sampling, resulting in better performance. For more exploration of the personality from text, pre-processing techniques including tokenization, word stemming, stop words elimination and feature selection using TF IDF are also exploited. This work provides the basis for developing a personality identification system which could assist organization for recruiting and selecting appropriate personnel and to improve their business by knowing the personality and preferences of their customers. The results obtained by all classifiers across all personality traits is good enough, however, the performance of XGBoost classifier is outstanding by achieving more than 99% precision and accuracy for different traits.
引用
收藏
页码:460 / 476
页数:17
相关论文
共 50 条
  • [21] Feature Selection for Text Classification Using Machine Learning Approaches
    K. Thirumoorthy
    K. Muneeswaran
    [J]. National Academy Science Letters, 2022, 45 : 51 - 56
  • [22] Text Classification Using Machine Learning Methods-A Survey
    Agarwal, Basant
    Mittal, Namita
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2012), 2014, 236 : 701 - 709
  • [23] Automatic text summarization using a machine learning approach
    Neto, JL
    Freitas, AA
    Kaestner, CAA
    [J]. ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, 2507 : 205 - 215
  • [24] Poem Classification Using Machine Learning Approach
    Kumar, Vipin
    Minz, Sonajharia
    [J]. PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON SOFT COMPUTING FOR PROBLEM SOLVING (SOCPROS 2012), 2014, 236 : 675 - 682
  • [25] A Hybrid Deep Learning Technique for Personality Trait Classification From Text
    Ahmad, Hussain
    Asghar, Muhammad Usama
    Asghar, Muhammad Zubair
    Khan, Aurangzeb
    Mosavi, Amir H.
    [J]. IEEE ACCESS, 2021, 9 : 146214 - 146232
  • [26] Classification of Online Toxic Comments Using Machine Learning Algorithms
    Rahul
    Kajla, Harsh
    Hooda, Jatin
    Saini, Gajanand
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS 2020), 2020, : 1119 - 1123
  • [27] Novel Machine Learning-Based Approach for Arabic Text Classification Using Stylistic and Semantic Features
    Fkih, Fethi
    Alsuhaibani, Mohammed
    Rhouma, Delel
    Qamar, Ali Mustafa
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (03): : 5871 - 5886
  • [28] Turkish Text Classification with Machine Learning and Transfer Learning
    Aydogan, Murat
    Karci, Ali
    [J]. 2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP 2019), 2019,
  • [29] Efficient English text classification using selected Machine Learning Techniques
    Luo, Xiaoyu
    [J]. ALEXANDRIA ENGINEERING JOURNAL, 2021, 60 (03) : 3401 - 3409
  • [30] Classification of emotional tone of dreams using machine learning and text analyses
    Razavi, A.
    Amini, R.
    Sabourin, C.
    Shirabad, Sayyad Y.
    Nadeau, D.
    De Koninck, J.
    Matwin, S.
    [J]. SLEEP, 2008, 31 : A380 - A381