Authorship Attribution of Arabic Tweets

被引:0
|
作者
Rabab'ah, Abdullateef [1 ]
Al-Ayyoub, Mahmoud [1 ]
Jararweh, Yaser [1 ]
Aldwairi, Monther [2 ]
机构
[1] Jordan Univ Sci & Technol, Irbid, Jordan
[2] Zayed Univ, Dubai, U Arab Emirates
关键词
Online Social Networks; Authorship Authentication; Stylometric Features; Bag-Of-Words; IDENTIFICATION;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In tweet authentication, we are concerned with correctly attributing a tweet to its true author based on its textual content. The more general problem of authenticating long documents has been studied before and the most common approach relies on the intuitive idea that each author has a unique style that can be captured using stylometric features (SF). Inspired by the success of modern automatic document classification problem, some researchers followed the Bag-Of-Words (BOW) approach for authenticating long documents. In this work, we consider both approaches and their application on authenticating tweets, which represent additional challenges due to the limitation in their sizes. We focus on the Arabic language due to its importance and the scarcity of works related on it. We create different sets of features from both approaches and compare the performance of different classifiers using them. To the best of our knowledge, this is the first study of its kind to combine these different sets of features for authorship analysis of Arabic tweets. The results show that combining all the feature sets we compute yields the best results.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Authorship Attribution of Arabic Articles
    Hajja, Maha
    Yahya, Ahmad
    Yahya, Adnan
    [J]. ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019, 2019, 1108 : 194 - 208
  • [2] Authorship Attribution in Arabic Poetry
    Ahmed, Alfalahi
    Mohamed, Ramdani
    Mostafa, Bellafkih
    [J]. 2016 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS: THEORIES AND APPLICATIONS (SITA), 2016,
  • [3] The Effectiveness of Stemming in the Stylometric Authorship Attribution in Arabic
    Omar, Abdulfattah
    Hamouda, Wafya Ibrahim
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (01) : 116 - 121
  • [4] Feature extraction and selection for Arabic tweets authorship authentication
    Mahmoud Al-Ayyoub
    Yaser Jararweh
    Abdullateef Rabab’ah
    Monther Aldwairi
    [J]. Journal of Ambient Intelligence and Humanized Computing, 2017, 8 : 383 - 393
  • [5] Feature extraction and selection for Arabic tweets authorship authentication
    Al-Ayyoub, Mahmoud
    Jararweh, Yaser
    Rabab'ah, Abdullateef
    Aldwairi, Monther
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2017, 8 (03) : 383 - 393
  • [6] Investigating Predictive Features for Authorship Verification of Arabic Tweets
    Alqahtani, Fatimah
    Dohler, Mischa
    [J]. INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2022, 22 (06): : 115 - 126
  • [7] Arabic Authorship Attribution: An Extensive Study on Twitter Posts
    Altakrori, Malik H.
    Iqbal, Farkhund
    Fung, Benjamin C. M.
    Ding, Steven H. H.
    Tubaishat, Abdallah
    [J]. ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2019, 18 (01)
  • [8] Naive Bayes classifiers for authorship attribution of Arabic texts
    Altheneyan, Alaa Saleh
    Menai, Mohamed El Bachir
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2014, 26 (04) : 473 - 484
  • [9] A Comparative Survey of Authorship Attribution on Short Arabic Texts
    Ouamour, Siham
    Sayoud, Halim
    [J]. SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 479 - 489
  • [10] Using Big Data Analytics For Authorship Authentication of Arabic Tweets
    Albadarneh, Jafar
    Talafha, Bashar
    Al-Ayyoub, Mahmoud
    Zaqaibeh, Belal
    Al-Smadi, Mohammad
    Jararweh, Yaser
    Benkhelifa, Elhadj
    [J]. 2015 IEEE/ACM 8TH INTERNATIONAL CONFERENCE ON UTILITY AND CLOUD COMPUTING (UCC), 2015, : 448 - 452