Learning User Embedding Representation for Gender Prediction

被引:0
|
作者
Chen, Li [1 ]
Qian, Tieyun [1 ]
Zhu, Peisong [1 ]
You, Zhenni [1 ]
机构
[1] Wuhan Univ, State Key Lab Software Engn, Wuhan, Peoples R China
关键词
gender prediction; user embedding; user representation;
D O I
10.1109/ICTAI.2016.45
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Predicting the gender of users in social media has aroused great interests in recent years. Almost all existing studies rely on the the content features extracted from the main texts like tweets or reviews. It is sometimes difficult to extract content information since many users do not write any posts at all. In this paper, we present a novel framework which uses only the users' ids and their social contexts for gender prediction. The key idea is to represent users in the embedding connection space. A user often has the social context of family members, schoolmates, colleagues, and friends. This is similar to a word and its contexts in documents, which motivates our study. However, when modifying the word embedding technique for user embedding, there are two major challenges. First, unlike the syntax in language, no rule is responsible for the composition of the social contexts. Second, new users were not seen when learning the representations and thus they do not have embedding vectors. Two strategies circular ordering and incremental updating are proposed to solve these problems. We evaluate our methodology on two real data sets. Experimental results demonstrate that our proposed approach is significantly better than the traditional graph representation and the state-of-the-art graph embedding baselines. It also outperforms the content based approaches by a large margin.
引用
收藏
页码:263 / 269
页数:7
相关论文
共 50 条
  • [21] SSE: Semantic Sentence Embedding for learning user interactions
    Lilian, J. Felicia
    Sundarakantham, K.
    Rajashree, Harine
    Shalinie, S. Mercy
    2019 10TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2019,
  • [22] Recursive RNN Based Shift Representation Learning for Dynamic User-Item Interaction Prediction
    Yin, Chengyu
    Wang, Senzhang
    Du, Jinlong
    Zhang, Meiyue
    ADVANCED DATA MINING AND APPLICATIONS, 2020, 12447 : 379 - 394
  • [23] Learning Job Representation Using Directed Graph Embedding
    Luo, Haiyan
    Ma, Shichuan
    Selvaraj, Anand Joseph Bernard
    Sun, Yu
    1ST INTERNATIONAL WORKSHOP ON DEEP LEARNING PRACTICE FOR HIGH-DIMENSIONAL SPARSE DATA WITH KDD (DLP-KDD 2019), 2019,
  • [24] Table representation learning using heterogeneous graph embedding
    Tchuitcheu, Willy Carlos
    Lu, Tan
    Dooms, Ann
    PATTERN RECOGNITION, 2024, 156
  • [25] Learning shared embedding representation of motion and text using contrastive learning
    Horie, Junpei
    Noguchi, Wataru
    Iizuka, Hiroyuki
    Yamamoto, Masahito
    ARTIFICIAL LIFE AND ROBOTICS, 2023, 28 (01) : 148 - 157
  • [26] Learning shared embedding representation of motion and text using contrastive learning
    Junpei Horie
    Wataru Noguchi
    Hiroyuki Iizuka
    Masahito Yamamoto
    Artificial Life and Robotics, 2023, 28 : 148 - 157
  • [27] HerGePred: Heterogeneous Network Embedding Representation for Disease Gene Prediction
    Yang, Kuo
    Wang, Ruyu
    Liu, Guangming
    Shu, Zixin
    Wang, Ning
    Zhang, Runshun
    Yu, Jian
    Chen, Jianxin
    Li, Xiaodong
    Zhou, Xuezhong
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2019, 23 (04) : 1805 - 1815
  • [28] Knowledge graph embedding with inverse function representation for link prediction
    Zhang, Qianjin
    Xu, Yandan
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
  • [29] LGTRL-DE: Local and Global Temporal Representation Learning with Demographic Embedding for in-hospital mortality prediction
    Zou, Mengjie
    An, Ying
    Kuang, Hulin
    Wang, Jianxin
    JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 143
  • [30] User preference and embedding learning with implicit feedback for recommender systems
    Sidana, Sumit
    Trofimov, Mikhail
    Horodnytskyi, Oleh
    Laclau, Charlotte
    Maximov, Yury
    Amini, Massih-Reza
    DATA MINING AND KNOWLEDGE DISCOVERY, 2021, 35 (02) : 568 - 592