Personality Traits Assessment: A Case of Study Using Text Mining Techniques

被引:0
|
作者
Sapino, Maximiliano [1 ,2 ,3 ]
Cagnina, Leticia [1 ,2 ,3 ]
Montenegro, Luis [1 ]
Ferretti, Edgardo [1 ,2 ]
机构
[1] Univ Nacl San Luis UNSL, San Luis, Argentina
[2] UNSL, Lab Invest & Desarrollo Inteligencia Computac, San Luis, Argentina
[3] Consejo Nacl Invest Cient & Tecn CONICET, San Luis, Argentina
来源
COMPUTER SCIENCE-CACIC 2023 | 2024年 / 2123卷
关键词
Personality Traits; Big Five Factors; Knowledge Discovery in Databases; Text Mining; Data Augmentation;
D O I
10.1007/978-3-031-62245-8_14
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
This paper presents a complete experience to solving a personality trait assessment problem using Text Mining techniques within the framework of Knowledge Discovery in Databases. The study involves collaboration between researchers from the fields of Computer Science and Psychology, highlighting the interdisciplinary nature of the work. In this study, four basic predictive algorithms were evaluated: Multinomial Naive Bayes, Logistic Regression, Support VectorMachines, and Decision Trees. These algorithms were applied to address the classification problem posed by personality trait assessment. Given the nature of the problem, where individuals may possess multiple personality traits to varying degrees, the classification task was modeled in three different ways: binary, multiclass, and multilabel. To enhance the performance of the classification approaches, a data augmentation techniquewas employed. The results indicate that data augmentation improves the performance of all classification approaches, with binary classification benefiting the most. Moreover, for three out of the five personality traits studied, the weighted-F1 scores exceed 0.75, indicating strong predictive accuracy. Notably, the Responsibility trait achieves the highest score of 0.88, demonstrating the effectiveness of the classification approach for this particular trait.
引用
下载
收藏
页码:197 / 212
页数:16
相关论文
共 50 条
  • [1] Using Text Mining Techniques to Identify Research Trends: A Case Study of Design Research
    Nie, Binling
    Sun, Shouqian
    APPLIED SCIENCES-BASEL, 2017, 7 (04):
  • [2] Online Healthcare Information Adoption Assessment Using Text Mining Techniques
    Prabha, M. Surya
    Sarojini, B.
    MOBILE NETWORKS & APPLICATIONS, 2019, 24 (04): : 1160 - 1165
  • [3] Online Healthcare Information Adoption Assessment Using Text Mining Techniques
    M. Surya Prabha
    B. Sarojini
    Mobile Networks and Applications, 2019, 24 : 1160 - 1165
  • [4] Assessment of building operational performance using data mining techniques: a case study
    Fan, Cheng
    Xiao, Fu
    8TH INTERNATIONAL CONFERENCE ON SUSTAINABILITY IN ENERGY AND BUILDINGS, SEB-16, 2017, 111 : 1070 - 1078
  • [5] Brand Personality Traits of World Heritage Sites: Text Mining Approach
    Hassan, Mohamed Abdalla Elsayed
    Zerva, Konstantina
    Aulet, Silvia
    SUSTAINABILITY, 2021, 13 (11)
  • [6] Using text mining techniques for identifying research gaps and priorities: a case study of the environmental science in Iran
    Rabiei, Mohammad
    Hosseini-Motlagh, Seyyed-Mahdi
    Haeri, Abdorrahman
    SCIENTOMETRICS, 2017, 110 (02) : 815 - 842
  • [7] Using text mining techniques for identifying research gaps and priorities: a case study of the environmental science in Iran
    Mohammad Rabiei
    Seyyed-Mahdi Hosseini-Motlagh
    Abdorrahman Haeri
    Scientometrics, 2017, 110 : 815 - 842
  • [8] A Comparative Study of Various Text Mining Techniques
    Tayal, Sandeep
    Goel, Sushil Kumar
    Sharma, Kapil
    2015 2ND INTERNATIONAL CONFERENCE ON COMPUTING FOR SUSTAINABLE GLOBAL DEVELOPMENT (INDIACOM), 2015, : 1637 - 1642
  • [9] Techniques on Text Mining
    Sukanya, M.
    Biruntha, S.
    2012 IEEE INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION CONTROL AND COMPUTING TECHNOLOGIES (ICACCCT), 2012, : 269 - 271
  • [10] Text data mining: A case study
    Ford, CW
    Chiang, CC
    Wu, H
    Chilka, RR
    Talburt, JR
    ITCC 2005: INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY: CODING AND COMPUTING, VOL 1, 2005, : 122 - 127