Recentred local profiles for authorship attribution

被引:17
|
作者
Layton, Robert [1 ]
Watters, Paul [1 ]
Dazeley, Richard [2 ]
机构
[1] Univ Ballarat, Internet Commerce Secur Lab, Ballarat, Vic 3353, Australia
[2] Univ Ballarat, Data Min & Informat Res Grp, Ballarat, Vic 3353, Australia
关键词
STYLE; IDENTIFICATION;
D O I
10.1017/S1351324911000180
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Authorship attribution methods aim to determine the author of a document, by using information gathered from a set of documents with known authors. One method of performing this task is to create profiles containing distinctive features known to be used by each author. In this paper, a new method of creating an author or document profile is presented that detects features considered distinctive, compared to normal language usage. This recentreing approach creates more accurate profiles than previous methods, as demonstrated empirically using a known corpus of authorship problems. This method, named recentred local profiles, determines authorship accurately using a simple 'best matching author' approach to classification, compared to other methods in the literature. The proposed method is shown to be more stable than related methods as parameter values change. Using a weighted voting scheme, recentred local profiles is shown to outperform other methods in authorship attribution, with an overall accuracy of 69.9% on the ad-hoc authorship attribution competition corpus, representing a significant improvement over related methods.
引用
收藏
页码:293 / 312
页数:20
相关论文
共 50 条
  • [21] Scalability Issues in Authorship Attribution
    Argamon, Shlomo
    [J]. LITERARY AND LINGUISTIC COMPUTING, 2012, 27 (01): : 95 - 97
  • [22] A New Approach for Authorship Attribution
    Reddy, P. Buddha
    Reddy, T. Raghunadha
    Chand, M. Gopi
    Venkannababu, A.
    [J]. INFORMATION AND DECISION SCIENCES, 2018, 701 : 1 - 9
  • [23] THE REQUISITES OF UNIFORMITY AND THE ATTRIBUTION OR AUTHORSHIP
    PULIDO, M
    [J]. MEDICINA CLINICA, 1994, 103 (16): : 638 - 638
  • [24] Estimating the Probability of an Authorship Attribution
    Savoy, Jacques
    [J]. JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2016, 67 (06) : 1462 - 1472
  • [25] Future trends in authorship attribution
    Juola, Patrick
    [J]. ADVANCES IN DIGITAL FORENSIC III, 2007, 242 : 119 - 132
  • [26] Authorship Attribution of Scientific Abstracts
    Suman, Chanchal
    Saha, Sriparna
    Bhattacharyya, Pushpak
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1522 - 1528
  • [27] Authorship Attribution Using Entropy
    Grabchak, M.
    Zhang, Z.
    Zhang, D. T.
    [J]. JOURNAL OF QUANTITATIVE LINGUISTICS, 2013, 20 (04) : 301 - 313
  • [28] Authorship Attribution of Android Apps
    Gonzalez, Hugo
    Stakhanova, Natalia
    Ghorbani, Ali A.
    [J]. PROCEEDINGS OF THE EIGHTH ACM CONFERENCE ON DATA AND APPLICATION SECURITY AND PRIVACY (CODASPY'18), 2018, : 277 - 286
  • [29] Authorship Attribution of Arabic Tweets
    Rabab'ah, Abdullateef
    Al-Ayyoub, Mahmoud
    Jararweh, Yaser
    Aldwairi, Monther
    [J]. 2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [30] Authorship Attribution in Arabic Poetry
    Ahmed, Alfalahi
    Mohamed, Ramdani
    Mostafa, Bellafkih
    [J]. 2016 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS: THEORIES AND APPLICATIONS (SITA), 2016,