Learning User Embedding Representation for Gender Prediction

被引:0
|
作者
Chen, Li [1 ]
Qian, Tieyun [1 ]
Zhu, Peisong [1 ]
You, Zhenni [1 ]
机构
[1] Wuhan Univ, State Key Lab Software Engn, Wuhan, Peoples R China
关键词
gender prediction; user embedding; user representation;
D O I
10.1109/ICTAI.2016.45
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting the gender of users in social media has aroused great interests in recent years. Almost all existing studies rely on the the content features extracted from the main texts like tweets or reviews. It is sometimes difficult to extract content information since many users do not write any posts at all. In this paper, we present a novel framework which uses only the users' ids and their social contexts for gender prediction. The key idea is to represent users in the embedding connection space. A user often has the social context of family members, schoolmates, colleagues, and friends. This is similar to a word and its contexts in documents, which motivates our study. However, when modifying the word embedding technique for user embedding, there are two major challenges. First, unlike the syntax in language, no rule is responsible for the composition of the social contexts. Second, new users were not seen when learning the representations and thus they do not have embedding vectors. Two strategies circular ordering and incremental updating are proposed to solve these problems. We evaluate our methodology on two real data sets. Experimental results demonstrate that our proposed approach is significantly better than the traditional graph representation and the state-of-the-art graph embedding baselines. It also outperforms the content based approaches by a large margin.
引用
收藏
页码:263 / 269
页数:7
相关论文
共 50 条
  • [1] Neural Gender Prediction in Microblogging with Emotion-aware User Representation
    Wu, Chuhan
    Wu, Fangzhao
    Qi, Tao
    Liu, Junxin
    Huang, Yongfeng
    Xie, Xing
    [J]. PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2401 - 2404
  • [2] Joint embedding in hierarchical distance and semantic representation learning for link prediction
    Liu, Jin
    Chen, Jianye
    Fan, Chongfeng
    Zhou, Fengyu
    [J]. Big Data Research, 2024, 38
  • [3] UserRBPM: User Retweet Behavior Prediction with Graph Representation Learning
    Guo, Huihui
    Yang, Li
    Liu, Zeyu
    [J]. WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2021, 2021
  • [4] UserRBPM: User Retweet Behavior Prediction with Graph Representation Learning
    Guo, Huihui
    Yang, Li
    Liu, Zeyu
    [J]. Wireless Communications and Mobile Computing, 2021, 2021
  • [5] MPT-embedding: An unsupervised representation learning of code for software defect prediction
    Shi, Ke
    Lu, Yang
    Liu, Guangliang
    Wei, Zhenchun
    Chang, Jingfei
    [J]. JOURNAL OF SOFTWARE-EVOLUTION AND PROCESS, 2021, 33 (04)
  • [6] User Reactions Prediction Using Embedding Features
    Mohammadi, Samin
    Farahbakhsh, Reza
    Crespi, Noel
    [J]. 2018 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2018,
  • [7] User-Agnostic Model for Retweets Prediction Based on Graph-Embedding Representation of Social Neighborhood Information
    Gabriel Celayes, Pablo
    Ariel Dominguez, Martin
    Barsotti, Damian
    [J]. INFORMATION MANAGEMENT AND BIG DATA, SIMBIG 2023, 2024, 2142 : 107 - 120
  • [8] Adversarial Representation Mechanism Learning for Network Embedding
    He, Dongxiao
    Wang, Tao
    Zhai, Lu
    Jin, Di
    Yang, Liang
    Huang, Yuxiao
    Feng, Zhiyong
    Yu, Philip S.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (02) : 1200 - 1213
  • [9] Representation Learning of Knowledge Graphs with Embedding Subspaces
    Li, Chunhua
    Xian, Xuefeng
    Ai, Xusheng
    Cui, Zhiming
    [J]. SCIENTIFIC PROGRAMMING, 2020, 2020
  • [10] Information Diffusion Prediction with Network Regularized Role-based User Representation Learning
    Wang, Zhitao
    Chen, Chengyao
    Li, Wenjie
    [J]. ACM TRANSACTIONS ON KNOWLEDGE DISCOVERY FROM DATA, 2019, 13 (03)