CoLink: An Unsupervised Framework for User Identity Linkage

被引:0
|
作者
Zhong, Zexuan [1 ]
Cao, Yong [2 ]
Guo, Mu [2 ]
Nie, Zaiqing [3 ]
机构
[1] Microsoft Res, Beijing, Peoples R China
[2] Univ Illinois, Urbana, IL USA
[3] Alibaba AI Labs, Hangzhou, Zhejiang, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, it is very common for one person to be in different social networks. Linking identical users across different social networks, also known as the User Identity Linkage (UIL) problem, is fundamental for many applications. There are two major challenges in the UIL problem. First, it's extremely expensive to collect manually linked user pairs as training data. Second, the user attributes in different networks are usually defined and formatted very differently which makes attribute alignment very hard. In this paper we propose CoLink, a general unsupervised framework for the UIL problem. Col,ink employs a co-training algorithm, which manipulates two independent models, the attribute-based model and the relationship-based model, and makes them reinforce each other iteratively in an unsupervised way. We also propose the sequence-to-sequence learning as a very effective implementation of the attribute-based model, which can well handle the challenge of the attribute alignment by treating it as a machine translation problem. We apply CoLink to a UIL task of mapping the employees in an enterprise network to their LinkedIn profiles. The experiment results show that CoLink generally outperforms the state-of-the-art unsupervised approaches by an F1 increase over 20%.
引用
收藏
页码:5714 / 5721
页数:8
相关论文
共 50 条
  • [1] Retrofitting Embeddings for Unsupervised User Identity Linkage
    Zhou, Tao
    Lim, Ee-Peng
    Lee, Roy Ka-Wei
    Zhu, Feida
    Cao, Jiuxin
    [J]. ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2020, PT I, 2020, 12084 : 385 - 397
  • [2] Unsupervised User Identity Linkage via Factoid Embedding
    Xie, Wei
    Mu, Xin
    Lee, Roy Ka-Wei
    Zhu, Feida
    Lim, Ee-Peng
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1338 - 1343
  • [3] Distribution Distance Minimization for Unsupervised User Identity Linkage
    Li, Chaozhuo
    Wang, Senzhang
    Yu, Philip S.
    Zheng, Lei
    Zhang, Xiaoming
    Li, Zhoujun
    Liang, Yanbo
    [J]. CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 447 - 456
  • [4] Unsupervised User Identity Linkage via Graph Neural Networks
    Zhou, Fan
    Wen, Zijing
    Zhong, Ting
    Trajcevski, Goce
    Xu, Xovee
    Liu, Leyuan
    [J]. 2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [5] A Novel Framework with Information Fusion and Neighborhood Enhancement for User Identity Linkage
    Chen, Siyuan
    Wang, Jiahai
    Du, Xin
    Hu, Yanqing
    [J]. ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1754 - 1761
  • [6] User Identity Linkage by Latent User Space Modelling
    Mu, Xin
    Zhu, Feida
    Lim, Ee-Peng
    Xiao, Jing
    Wang, Jianzong
    Zhou, Zhi-Hua
    [J]. KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, : 1775 - 1784
  • [7] DeLink: An Adversarial Framework for Defending against Cross-site User Identity Linkage
    Zhang, Peng
    Zhou, Qi
    Lu, Tun
    Gu, Hansu
    Gu, Ning
    [J]. ACM TRANSACTIONS ON THE WEB, 2024, 18 (02)
  • [8] Anchor User Oriented Accordant Embedding for User Identity Linkage
    Li, Xiang
    Su, Yijun
    Gao, Neng
    Tang, Wei
    Xiang, Ji
    Wang, Yuewu
    [J]. NEURAL INFORMATION PROCESSING, ICONIP 2019, PT V, 2019, 1143 : 561 - 572
  • [9] A Semi-supervised Framework with Efficient Feature Extraction and Network Alignment for User Identity Linkage
    Hu, Zehua
    Wang, Jiahai
    Chen, Siyuan
    Du, Xin
    [J]. DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT II, 2021, 12682 : 675 - 691
  • [10] Hubness-aware User Identity Linkage
    Li, Chaozhuo
    Wang, Senzhang
    Huang, Feiran
    Xu, Jie
    Yu, Philip
    [J]. PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 3196 - 3200