Authorship Attribution of Arabic Tweets

被引:0
|
作者
Rabab'ah, Abdullateef [1 ]
Al-Ayyoub, Mahmoud [1 ]
Jararweh, Yaser [1 ]
Aldwairi, Monther [2 ]
机构
[1] Jordan Univ Sci & Technol, Irbid, Jordan
[2] Zayed Univ, Dubai, U Arab Emirates
关键词
Online Social Networks; Authorship Authentication; Stylometric Features; Bag-Of-Words; IDENTIFICATION;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In tweet authentication, we are concerned with correctly attributing a tweet to its true author based on its textual content. The more general problem of authenticating long documents has been studied before and the most common approach relies on the intuitive idea that each author has a unique style that can be captured using stylometric features (SF). Inspired by the success of modern automatic document classification problem, some researchers followed the Bag-Of-Words (BOW) approach for authenticating long documents. In this work, we consider both approaches and their application on authenticating tweets, which represent additional challenges due to the limitation in their sizes. We focus on the Arabic language due to its importance and the scarcity of works related on it. We create different sets of features from both approaches and compare the performance of different classifiers using them. To the best of our knowledge, this is the first study of its kind to combine these different sets of features for authorship analysis of Arabic tweets. The results show that combining all the feature sets we compute yields the best results.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] A New Approach for Authorship Attribution
    Reddy, P. Buddha
    Reddy, T. Raghunadha
    Chand, M. Gopi
    Venkannababu, A.
    [J]. INFORMATION AND DECISION SCIENCES, 2018, 701 : 1 - 9
  • [42] Estimating the Probability of an Authorship Attribution
    Savoy, Jacques
    [J]. JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2016, 67 (06) : 1462 - 1472
  • [43] THE REQUISITES OF UNIFORMITY AND THE ATTRIBUTION OR AUTHORSHIP
    PULIDO, M
    [J]. MEDICINA CLINICA, 1994, 103 (16): : 638 - 638
  • [44] Authorship Attribution of Android Apps
    Gonzalez, Hugo
    Stakhanova, Natalia
    Ghorbani, Ali A.
    [J]. PROCEEDINGS OF THE EIGHTH ACM CONFERENCE ON DATA AND APPLICATION SECURITY AND PRIVACY (CODASPY'18), 2018, : 277 - 286
  • [45] Future trends in authorship attribution
    Juola, Patrick
    [J]. ADVANCES IN DIGITAL FORENSIC III, 2007, 242 : 119 - 132
  • [46] Authorship Attribution Using Entropy
    Grabchak, M.
    Zhang, Z.
    Zhang, D. T.
    [J]. JOURNAL OF QUANTITATIVE LINGUISTICS, 2013, 20 (04) : 301 - 313
  • [47] Authorship Attribution of Scientific Abstracts
    Suman, Chanchal
    Saha, Sriparna
    Bhattacharyya, Pushpak
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1522 - 1528
  • [48] Computational Methods in Authorship Attribution
    Koppel, Moshe
    Schler, Jonathan
    Argamon, Shlorno
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (01): : 9 - 26
  • [49] An example of mathematical authorship attribution
    Basile, Chiara
    Benedetto, Dario
    Caglioti, Emanuele
    Esposti, Mirko Degli
    [J]. JOURNAL OF MATHEMATICAL PHYSICS, 2008, 49 (12)
  • [50] Evaluating Author Attribution on Emirati Tweets
    Khonji, Mahmoud
    Iraqi, Youssef
    [J]. IEEE ACCESS, 2020, 8 (08): : 149531 - 149543