How Machine Learning Classification Accuracy Changes in a Happiness Dataset with Different Demographic Groups

被引：5

作者：

Sweeney, Colm ^{[1
]}

Ennis, Edel ^{[1
]}

Mulvenna, Maurice ^{[2
]}

Bond, Raymond ^{[2
]}

O'Neill, Siobhan ^{[1
]}

机构：

[1] Ulster Univ, Sch Psychol, Coleraine BT52 1SA, Londonderry, North Ireland

[2] Ulster Univ, Sch Comp, Jordanstown BT37 0QB, North Ireland

来源：

COMPUTERS | 2022年 / 11卷 / 05期

关键词：

machine learning; classification; positive psychology; GENDER;

D O I：

10.3390/computers11050083

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

This study aims to explore how machine learning classification accuracy changes with different demographic groups. The HappyDB is a dataset that contains over 100,000 happy statements, incorporating demographic information that includes marital status, gender, age, and parenthood status. Using the happiness category field, we test different types of machine learning classifiers to predict what category of happiness the statements belong to, for example, whether they indicate happiness relating to achievement or affection. The tests were initially conducted with three distinct classifiers and the best performing model was the convolutional neural network (CNN) model, which is a deep learning algorithm, achieving an F1 score of 0.897 when used with the complete dataset. This model was then used as the main classifier to further analyze the results and to establish any variety in performance when tested on different demographic groups. We analyzed the results to see if classification accuracy was improved for different demographic groups, and found that the accuracy of prediction within this dataset declined with age, with the exception of the single parent subgroup. The results also showed improved performance for the married and parent subgroups, and lower performances for the non-parent and un-married subgroups, even when investigating a balanced sample.

引用

页数：15

共 50 条

[41] Comparison of the prediction accuracy of machine learning algorithms in crosslinguistic vowel classification
Georgiou, Georgios P.
SCIENTIFIC REPORTS, 2023, 13 (01):
[42] Improving Classification Accuracy of a Machine Learning approach for FPGA Timing Closure
Que Yanghua
Kapre, Nachiket
Ng, Harnhua
Teo, Kirvy
2016 IEEE 24TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2016, : 80 - 83
[43] Enhancing Accuracy of Arrhythmia Classification by Combining Logical and Machine Learning Techniques
Kalidas, Vignesh
Tamil, Lakshman S.
2015 COMPUTING IN CARDIOLOGY CONFERENCE (CINC), 2015, 42 : 733 - 736
[44] Comparison of the prediction accuracy of machine learning algorithms in crosslinguistic vowel classification
Georgios P. Georgiou
Scientific Reports, 13 (1)
[45] Uncertainty as a Predictor of Classification Accuracy in Machine Learning-Assisted Measurements
Shirmohammadi, Shervin
Amiri, Mohammad Hadi
Al Osman, Hussein
IEEE INSTRUMENTATION & MEASUREMENT MAGAZINE, 2024, 27 (07) : 37 - 45
[46] Differential Beat Accuracy for ECG Family Classification Using Machine Learning
Vadillo-Valderrama, Alba
Goya-Esteban, Rebeca
Caulier-Cisterna, Raul P.
Garcia-Alberola, Arcadi
Rojo-Alvarez, Jose Luis
IEEE ACCESS, 2022, 10 : 129362 - 129381
[47] Classification model for accuracy and intrusion detection using machine learning approach
Agarwal, Arushi
Sharma, Purushottam
Alshehri, Mohammed
Mohamed, Ahmed A.
Alfarraj, Osama
PEERJ COMPUTER SCIENCE, 2021,
[48] Text Classification: How Machine Learning Is Revolutionizing Text Categorization
Allam, Hesham
Makubvure, Lisa
Gyamfi, Benjamin
Graham, Kwadwo Nyarko
Akinwolere, Kehinde
INFORMATION, 2025, 16 (02)
[49] Optimal Kernel Extreme Learning Machine for COVID-19 Classification on Epidemiology Dataset
Alotaibi, Saud S.
Al-Rasheed, Amal
Althahabi, Sami
Hamza, Manar Ahmed
Mohamed, Abdullah
Zamani, Abu Sarwar
Motwakel, Abdelwahed
Eldesouki, Mohamed, I
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 73 (02): : 3305 - 3318
[50] Improving Classification Using Preprocessing and Machine Learning Algorithms on NSL-KDD Dataset
Deshmukh, Datta H.
Ghorpade, Tushar
Padiya, Puja
2015 INTERNATIONAL CONFERENCE ON COMMUNICATION, INFORMATION & COMPUTING TECHNOLOGY (ICCICT), 2015,

← 1 2 3 4 5 →