Multi-Domain Alias Matching Using Machine Learning

被引:0
|
作者
Ashcroft, Michael [1 ]
Johansson, Fredrik [2 ]
Kaati, Lisa [1 ,2 ]
Shrestha, Amendra [1 ]
机构
[1] Uppsala Univ, Dept Inf Technol, Uppsala, Sweden
[2] Swedish Def Res Agcy FOI, Stockholm, Sweden
关键词
D O I
10.1109/ENIC.2016.18
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We describe a methodology for linking aliases belonging to the same individual based on a user's writing style (stylometric features extracted from the user generated content) and her time patterns (time-based features extracted from the publishing times of the user generated content). While most previous research on social media identity linkage relies on matching usernames, our methodology can also be used for users who actively try to choose dissimilar usernames when creating their aliases. In our experiments on a discussion forum dataset and a Twitter dataset, we evaluate the performance of three different classifiers. We use the best classifier (AdaBoost) to evaluate how well it works on different datasets using different features. Experiments show that combining stylometric and time based features yield good results on our synthetic datasets and a small-scale evaluation on real-world blog data confirm these results, yielding a precision over 95%. The use of emotion-related and Twitter-related features yield no significant impact on the results.
引用
收藏
页码:77 / 84
页数:8
相关论文
共 50 条
  • [1] Predictive Distribution Matching SVM for Multi-domain Learning
    Seah, Chun-Wei
    Tsang, Ivor W.
    Ong, Yew-Soon
    Lee, Kee-Khoon
    [J]. MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I: EUROPEAN CONFERENCE, ECML PKDD 2010, 2010, 6321 : 231 - 247
  • [2] Extracting Relationships by Multi-Domain Matching
    Li, Yitong
    Murias, Michael
    Major, Samantha
    Dawson, Geraldine
    Carlson, David E.
    [J]. ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [3] Combinatorial coverage framework for machine learning in multi-domain operations
    Cody, Tyler
    Kauffman, Justin
    Krometis, Justin
    Sobien, Dan
    Freeman, Laura
    [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS IV, 2022, 12113
  • [4] Federated Machine Learning for Multi-Domain Operations at the Tactical Edge
    Cirincione, Gregory
    Verma, Dinesh
    [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS, 2019, 11006
  • [5] Virtualized controller placement for multi-domain optical transport networks using machine learning
    Rahman, Sabidur
    Ahmed, Tanjila
    Ferdousi, Sifat
    Bhaumik, Partha
    Chowdhury, Pulak
    Tornatore, Massimo
    Das, Goutam
    Mukherjee, Biswanath
    [J]. PHOTONIC NETWORK COMMUNICATIONS, 2020, 40 (03) : 126 - 136
  • [6] Virtualized controller placement for multi-domain optical transport networks using machine learning
    Sabidur Rahman
    Tanjila Ahmed
    Sifat Ferdousi
    Partha Bhaumik
    Pulak Chowdhury
    Massimo Tornatore
    Goutam Das
    Biswanath Mukherjee
    [J]. Photonic Network Communications, 2020, 40 : 126 - 136
  • [7] Revisiting Multi-Domain Machine Translation
    MinhQuang Pham
    Crego, Josep Maria
    Yvon, Francois
    [J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2021, 9 : 17 - 35
  • [8] Federated Machine Learning for Resource Allocation in Multi-domain Fog Ecosystems
    Zhang, Weilin
    Toor, Salman
    Al-Naday, Mays
    [J]. 2023 IEEE 12TH INTERNATIONAL CONFERENCE ON CLOUD NETWORKING, CLOUDNET, 2023, : 290 - 298
  • [9] Secure and Trustworthy Machine Learning/Artificial Intelligence for Multi-Domain Operations
    Rawat, Danda
    [J]. ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS III, 2021, 11746
  • [10] An ensemble approach to stabilize the features for multi-domain sentiment analysis using supervised machine learning
    Ghosh M.
    Sanyal G.
    [J]. Ghosh, Monalisa (monalisa_05mca@yahoo.com), 2018, SpringerOpen (05)