Language Independent Gender Classification on Twitter

被引:0
|
作者
Alowibdi, Jalal S. [1 ,2 ]
Buy, Ugo A. [1 ]
Yu, Philip [1 ,2 ]
机构
[1] Univ Illinois, Dept Comp Sci, Chicago, IL 60680 USA
[2] King Abdulaziz Univ, Fac Comp & Informat Technol, Jeddah, Saudi Arabia
关键词
Color-based Features; Social Network; Application for Social Network; Language Independent;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Online Social Networks (OSNs) generate a huge volume of user-originated texts. Gender classification can serve multiple purposes. For example, commercial organizations can use gender classification for advertising. Law enforcement may use gender classification as part of legal investigations. Others may use gender information for social reasons. Here we explore language independent gender classification. Our approach predicts gender using five color-based features extracted from Twitter profiles (e.g., the background color in a user's profile page). Most other methods for gender prediction are typically language dependent. Those methods use high-dimensional spaces consisting of unique words extracted from such text fields as postings, user names, and profile descriptions. Our approach is independent of the user's language, efficient, and scalable, while attaining a good level of accuracy. We prove the validity of our approach by examining different classifiers over a large dataset of Twitter profiles.
引用
收藏
页码:745 / 749
页数:5
相关论文
共 50 条
  • [1] Gender-inclusive Language in Twitter
    Samples, Caitlin E.
    [J]. HISPANIA-A JOURNAL DEVOTED TO THE TEACHING OF SPANISH AND PORTUGUESE, 2024, 107 (01): : 139 - 160
  • [2] Language-Independent Twitter Classification Using Character-Based Convolutional Networks
    Zhang, Shiwei
    Zhang, Xiuzhen
    Chan, Jeffrey
    [J]. ADVANCED DATA MINING AND APPLICATIONS, ADMA 2017, 2017, 10604 : 413 - 425
  • [3] Arabic Offensive Language Classification on Twitter
    Mubarak, Hamdy
    Darwish, Kareem
    [J]. SOCIAL INFORMATICS, SOCINFO 2019, 2019, 11864 : 269 - 276
  • [4] Gender Classification using Twitter Text Data
    Vashisth, Pradeep
    Meehan, Kevin
    [J]. 2020 31ST IRISH SIGNALS AND SYSTEMS CONFERENCE (ISSC), 2020, : 56 - 61
  • [5] Automatic Identification and Classification of Misogynistic Language on Twitter
    Anzovino, Maria
    Fersini, Elisabetta
    Rosso, Paolo
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2018), 2018, 10859 : 57 - 64
  • [6] Language independent gender identification
    Parris, ES
    Carey, MJ
    [J]. 1996 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, CONFERENCE PROCEEDINGS, VOLS 1-6, 1996, : 685 - 688
  • [7] THE CLASSIFICATION OF GENDER IN THE CREE LANGUAGE
    VAILLANCOURT, LP
    [J]. ANTHROPOLOGICA, 1982, 24 (02) : 207 - 214
  • [8] Twitter gender classification using user unstructured information
    Vicente, Marco
    Batista, Fernando
    Carvalho, Joao Paulo
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE 2015), 2015,
  • [9] Empirical Evaluation of Profile Characteristics for Gender Classification on Twitter
    Alowibdi, Jalal S.
    Buy, Ugo A.
    Yu, Philip
    [J]. 2013 12TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2013), VOL 1, 2013, : 365 - 369
  • [10] Language and Culture Effects on Gender Classification of Objects
    Nicoladis, Elena
    Foursha-Stevenson, Cassandra
    [J]. JOURNAL OF CROSS-CULTURAL PSYCHOLOGY, 2012, 43 (07) : 1095 - 1109