How Machine Learning Classification Accuracy Changes in a Happiness Dataset with Different Demographic Groups

被引:5
|
作者
Sweeney, Colm [1 ]
Ennis, Edel [1 ]
Mulvenna, Maurice [2 ]
Bond, Raymond [2 ]
O'Neill, Siobhan [1 ]
机构
[1] Ulster Univ, Sch Psychol, Coleraine BT52 1SA, Londonderry, North Ireland
[2] Ulster Univ, Sch Comp, Jordanstown BT37 0QB, North Ireland
关键词
machine learning; classification; positive psychology; GENDER;
D O I
10.3390/computers11050083
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This study aims to explore how machine learning classification accuracy changes with different demographic groups. The HappyDB is a dataset that contains over 100,000 happy statements, incorporating demographic information that includes marital status, gender, age, and parenthood status. Using the happiness category field, we test different types of machine learning classifiers to predict what category of happiness the statements belong to, for example, whether they indicate happiness relating to achievement or affection. The tests were initially conducted with three distinct classifiers and the best performing model was the convolutional neural network (CNN) model, which is a deep learning algorithm, achieving an F1 score of 0.897 when used with the complete dataset. This model was then used as the main classifier to further analyze the results and to establish any variety in performance when tested on different demographic groups. We analyzed the results to see if classification accuracy was improved for different demographic groups, and found that the accuracy of prediction within this dataset declined with age, with the exception of the single parent subgroup. The results also showed improved performance for the married and parent subgroups, and lower performances for the non-parent and un-married subgroups, even when investigating a balanced sample.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Comparison of the prediction accuracy of machine learning algorithms in crosslinguistic vowel classification
    Georgiou, Georgios P.
    SCIENTIFIC REPORTS, 2023, 13 (01):
  • [42] Improving Classification Accuracy of a Machine Learning approach for FPGA Timing Closure
    Que Yanghua
    Kapre, Nachiket
    Ng, Harnhua
    Teo, Kirvy
    2016 IEEE 24TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2016, : 80 - 83
  • [43] Enhancing Accuracy of Arrhythmia Classification by Combining Logical and Machine Learning Techniques
    Kalidas, Vignesh
    Tamil, Lakshman S.
    2015 COMPUTING IN CARDIOLOGY CONFERENCE (CINC), 2015, 42 : 733 - 736
  • [44] Comparison of the prediction accuracy of machine learning algorithms in crosslinguistic vowel classification
    Georgios P. Georgiou
    Scientific Reports, 13 (1)
  • [45] Uncertainty as a Predictor of Classification Accuracy in Machine Learning-Assisted Measurements
    Shirmohammadi, Shervin
    Amiri, Mohammad Hadi
    Al Osman, Hussein
    IEEE INSTRUMENTATION & MEASUREMENT MAGAZINE, 2024, 27 (07) : 37 - 45
  • [46] Differential Beat Accuracy for ECG Family Classification Using Machine Learning
    Vadillo-Valderrama, Alba
    Goya-Esteban, Rebeca
    Caulier-Cisterna, Raul P.
    Garcia-Alberola, Arcadi
    Rojo-Alvarez, Jose Luis
    IEEE ACCESS, 2022, 10 : 129362 - 129381
  • [47] Classification model for accuracy and intrusion detection using machine learning approach
    Agarwal, Arushi
    Sharma, Purushottam
    Alshehri, Mohammed
    Mohamed, Ahmed A.
    Alfarraj, Osama
    PEERJ COMPUTER SCIENCE, 2021,
  • [48] Text Classification: How Machine Learning Is Revolutionizing Text Categorization
    Allam, Hesham
    Makubvure, Lisa
    Gyamfi, Benjamin
    Graham, Kwadwo Nyarko
    Akinwolere, Kehinde
    INFORMATION, 2025, 16 (02)
  • [49] Optimal Kernel Extreme Learning Machine for COVID-19 Classification on Epidemiology Dataset
    Alotaibi, Saud S.
    Al-Rasheed, Amal
    Althahabi, Sami
    Hamza, Manar Ahmed
    Mohamed, Abdullah
    Zamani, Abu Sarwar
    Motwakel, Abdelwahed
    Eldesouki, Mohamed, I
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (02): : 3305 - 3318
  • [50] Improving Classification Using Preprocessing and Machine Learning Algorithms on NSL-KDD Dataset
    Deshmukh, Datta H.
    Ghorpade, Tushar
    Padiya, Puja
    2015 INTERNATIONAL CONFERENCE ON COMMUNICATION, INFORMATION & COMPUTING TECHNOLOGY (ICCICT), 2015,