Text-Based Gender Classification of Twitter Data using Naive Bayes and SVM Algorithm

被引:0
|
作者
Angeles, Angelic [1 ]
Quintos, Maria Nikki [1 ]
Octaviano, Manolito, Jr. [1 ]
Raga, Rodolofo, Jr. [2 ]
机构
[1] Natl Univ, Dept Comp Sci, Manila, Philippines
[2] Jose Rizal Univ, Dept Comp Sci, Mandaluyong, Philippines
关键词
machine learning; classification; gender; social media; Twitter; tweets; extraction; meta-attributes; Naive Bayes; Multinomial Naive Bayes; Support Vector Machine (SVM);
D O I
10.1109/TENCON54134.2021.9707402
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents the development of the gender classification system on Twitter tweets. Three feature extraction techniques are explored: Bag of words and 2 variations of meta-attributes extraction. Feature sets are fed to Multinomial Naive Bayes and Support Vector Machine, and results were compared to see which algorithm can produce the best results in the classification task. Experiments show that the SVM outperformed the Naive Bayes algorithm, obtaining a performance of 56.31%.
引用
收藏
页码:522 / 526
页数:5
相关论文
共 50 条
  • [1] Personality Classification Based on Twitter Text Using Naive Bayes, KNN and SVM
    Pratama, Bayu Yudha
    Sarno, Riyanarto
    [J]. 2015 INTERNATIONAL CONFERENCE ON DATA AND SOFTWARE ENGINEERING (ICODSE), 2015, : 170 - 174
  • [2] Text-based Language Identifier using Multinomial Naive Bayes Algorithm
    Rawat, Sunita
    Werulkar, Lakshita
    Jaywant, Sagarika
    [J]. INTERNATIONAL JOURNAL OF NEXT-GENERATION COMPUTING, 2023, 14 (01): : 96 - 102
  • [3] Using Naive Bayes Method to Classify Text-based Email
    Kang, LanLan
    Chen, Ruey-Shun
    Chen, Yeh-Cheng
    Cao, WenLiang
    [J]. 2018 9TH INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES, ALGORITHMS AND PROGRAMMING (PAAP 2018), 2018, : 94 - 98
  • [4] Gender Classification using Twitter Text Data
    Vashisth, Pradeep
    Meehan, Kevin
    [J]. 2020 31ST IRISH SIGNALS AND SYSTEMS CONFERENCE (ISSC), 2020, : 56 - 61
  • [5] Text Classification Based on Naive Bayes Algorithm with Feature Selection
    Chen, Zhenguo
    Shi, Guang
    Wang, Xiaoju
    [J]. INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2012, 15 (10): : 4255 - 4260
  • [6] A Chinese text classification system based on Naive Bayes algorithm
    Cui, Wei
    [J]. 2016 INTERNATIONAL CONFERENCE ON ELECTRONIC, INFORMATION AND COMPUTER ENGINEERING, 2016, 44
  • [7] A Voice Activity Detector using SVM and Naive Bayes Classification Algorithm
    Selvakumari, N. A. Sheela
    Radha, V.
    [J]. PROCEEDINGS OF 2017 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION (ICSPC'17), 2017, : 1 - 6
  • [8] Comparison of SVM and Naive Bayes for Sentiment Classification using BERT data
    Rana, Shivani
    Kanji, Rakesh
    Jain, Shruti
    [J]. 2022 5TH INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2022,
  • [9] Chinese News Text Multi Classification Based on Naive Bayes Algorithm
    Wang, Fei
    Deng, Xin
    Hou, Lunqing
    [J]. ISCSIC'18: PROCEEDINGS OF THE 2ND INTERNATIONAL SYMPOSIUM ON COMPUTER SCIENCE AND INTELLIGENT CONTROL, 2018,
  • [10] Classify Text-based Email Using Naive Bayes Method With Small Sample
    Zhu, Yanjun
    Zhu, Ting
    Li, Jianxin
    Cao, Wenliang
    Yong, Peng
    Jiang, Fei
    Liu, Jie
    [J]. JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2023, 39 (04) : 855 - 868