Recentred local profiles for authorship attribution

被引:17
|
作者
Layton, Robert [1 ]
Watters, Paul [1 ]
Dazeley, Richard [2 ]
机构
[1] Univ Ballarat, Internet Commerce Secur Lab, Ballarat, Vic 3353, Australia
[2] Univ Ballarat, Data Min & Informat Res Grp, Ballarat, Vic 3353, Australia
关键词
STYLE; IDENTIFICATION;
D O I
10.1017/S1351324911000180
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Authorship attribution methods aim to determine the author of a document, by using information gathered from a set of documents with known authors. One method of performing this task is to create profiles containing distinctive features known to be used by each author. In this paper, a new method of creating an author or document profile is presented that detects features considered distinctive, compared to normal language usage. This recentreing approach creates more accurate profiles than previous methods, as demonstrated empirically using a known corpus of authorship problems. This method, named recentred local profiles, determines authorship accurately using a simple 'best matching author' approach to classification, compared to other methods in the literature. The proposed method is shown to be more stable than related methods as parameter values change. Using a weighted voting scheme, recentred local profiles is shown to outperform other methods in authorship attribution, with an overall accuracy of 69.9% on the ad-hoc authorship attribution competition corpus, representing a significant improvement over related methods.
引用
收藏
页码:293 / 312
页数:20
相关论文
共 50 条
  • [1] Authorship Attribution Using Diversity Profiles
    Grabchak, Michael
    Cao, Lijuan
    Zhang, Zhiyi
    [J]. JOURNAL OF QUANTITATIVE LINGUISTICS, 2018, 25 (02) : 142 - 155
  • [2] Patterns of local discourse coherence as a feature for authorship attribution
    Feng, Vanessa Wei
    Hirst, Graeme
    [J]. LITERARY AND LINGUISTIC COMPUTING, 2014, 29 (02): : 191 - 198
  • [3] AUTHORSHIP ATTRIBUTION
    HOLMES, DI
    [J]. COMPUTERS AND THE HUMANITIES, 1994, 28 (02): : 87 - 106
  • [4] JOINT AUTHORSHIP: A GLIMPSE INTO SOME LOCAL PRACTICES OF MERIT ATTRIBUTION
    Burada, Marinela
    [J]. 13TH CONFERENCE ON BRITISH AND AMERICAN STUDIES: LANGUAGE DIVERSITY IN A GLOBALIZED WORLD, 2017, : 198 - 216
  • [5] Versification and Authorship Attribution
    Gomez Camelo, Laura Camila
    Munoz Landinez, Valeria
    [J]. LITERATURA-TEORIA HISTORIA CRITICA, 2023, 25 (02): : 308 - 315
  • [6] Authorship attribution in the wild
    Moshe Koppel
    Jonathan Schler
    Shlomo Argamon
    [J]. Language Resources and Evaluation, 2011, 45 : 83 - 94
  • [7] Championing authorship attribution
    不详
    [J]. NATURE CELL BIOLOGY, 2017, 19 (06) : 579 - 579
  • [8] Authorship Attribution and Pastiche
    Harold Somers
    Fiona Tweedie
    [J]. Computers and the Humanities, 2003, 37 : 407 - 429
  • [9] Authorship Attribution System
    Marchenko, Oleksandr
    Anisimov, Anatoly
    Nykonenko, Andrii
    Rossada, Tetiana
    Melnikov, Egor
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2017, 2017, 10260 : 227 - 231
  • [10] Authorship attribution and pastiche
    Somers, H
    Tweedie, F
    [J]. COMPUTERS AND THE HUMANITIES, 2003, 37 (04): : 407 - 429