Authorship Attribution of Arabic Tweets

被引:0
|
作者
Rabab'ah, Abdullateef [1 ]
Al-Ayyoub, Mahmoud [1 ]
Jararweh, Yaser [1 ]
Aldwairi, Monther [2 ]
机构
[1] Jordan Univ Sci & Technol, Irbid, Jordan
[2] Zayed Univ, Dubai, U Arab Emirates
关键词
Online Social Networks; Authorship Authentication; Stylometric Features; Bag-Of-Words; IDENTIFICATION;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In tweet authentication, we are concerned with correctly attributing a tweet to its true author based on its textual content. The more general problem of authenticating long documents has been studied before and the most common approach relies on the intuitive idea that each author has a unique style that can be captured using stylometric features (SF). Inspired by the success of modern automatic document classification problem, some researchers followed the Bag-Of-Words (BOW) approach for authenticating long documents. In this work, we consider both approaches and their application on authenticating tweets, which represent additional challenges due to the limitation in their sizes. We focus on the Arabic language due to its importance and the scarcity of works related on it. We create different sets of features from both approaches and compare the performance of different classifiers using them. To the best of our knowledge, this is the first study of its kind to combine these different sets of features for authorship analysis of Arabic tweets. The results show that combining all the feature sets we compute yields the best results.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Championing authorship attribution
    不详
    [J]. NATURE CELL BIOLOGY, 2017, 19 (06) : 579 - 579
  • [22] Authorship attribution in the wild
    Moshe Koppel
    Jonathan Schler
    Shlomo Argamon
    [J]. Language Resources and Evaluation, 2011, 45 : 83 - 94
  • [23] Authorship Attribution and Pastiche
    Harold Somers
    Fiona Tweedie
    [J]. Computers and the Humanities, 2003, 37 : 407 - 429
  • [24] Authorship Attribution System
    Marchenko, Oleksandr
    Anisimov, Anatoly
    Nykonenko, Andrii
    Rossada, Tetiana
    Melnikov, Egor
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2017, 2017, 10260 : 227 - 231
  • [25] Authorship attribution and pastiche
    Somers, H
    Tweedie, F
    [J]. COMPUTERS AND THE HUMANITIES, 2003, 37 (04): : 407 - 429
  • [26] Authorship attribution in the wild
    Koppel, Moshe
    Schler, Jonathan
    Argamon, Shlomo
    [J]. LANGUAGE RESOURCES AND EVALUATION, 2011, 45 (01) : 83 - 94
  • [27] Automatic authorship attribution
    Stamatatos, E
    Fakotakis, N
    Kokkinakis, G
    [J]. NINTH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS, 1999, : 158 - 164
  • [28] Versification and Authorship Attribution
    Macutek, Jan
    [J]. CESKA LITERATURA, 2022, 70 (06): : 773 - 777
  • [29] Championing authorship attribution
    [J]. Nature Cell Biology, 2017, 19 : 579 - 579
  • [30] On the State of the Art in Authorship Attribution and Authorship Verification
    Tyo, Jacob
    Dhingra, Bhuwan
    Lipton, Zachary C.
    [J]. arXiv, 2022,