Authorship Attribution Using Text Distortion

被引:0
|
作者
Stamatatos, Efstathios [1 ]
机构
[1] Univ Aegean, Samos 83200, Greece
关键词
GENRE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Authorship attribution is associated with important applications in forensics and humanities research. A crucial point in this field is to quantify the personal style of writing, ideally in a way that is not affected by changes in topic or genre. In this paper, we present a novel method that enhances authorship attribution effectiveness by introducing a text distortion step before extracting stylometric measures. The proposed method attempts to mask topic-specific information that is not related to the personal style of authors. Based on experiments on two main tasks in authorship attribution, closed-set attribution and authorship verification, we demonstrate that the proposed approach can enhance existing methods especially under cross-topic conditions, where the training and test corpora do not match in topic.
引用
收藏
页码:1138 / 1149
页数:12
相关论文
共 50 条
  • [1] A review on authorship attribution in text mining
    Zheng, Wanwan
    Jin, Mingzhe
    [J]. WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2023, 15 (02)
  • [2] Authorship Attribution for Neural Text Generation
    Uchendu, Adaku
    Le, Thai
    Shu, Kai
    Lee, Dongwon
    [J]. PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 8384 - 8395
  • [3] Data mining of text as a tool in authorship attribution
    Visa, A
    Toivonen, J
    Autio, S
    Mäkinen, J
    Back, B
    Vanharanta, H
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS AND TECHNOLOGY III, 2001, 4384 : 149 - 156
  • [4] Selecting text features relevant for authorship attribution
    Zoya, Rezanova, I
    Alexandr, Romanov S.
    Roman, Meshcheryakov, V
    [J]. VESTNIK TOMSKOGO GOSUDARSTVENNOGO UNIVERSITETA FILOLOGIYA-TOMSK STATE UNIVERSITY JOURNAL OF PHILOLOGY, 2013, 26 (06): : 38 - 52
  • [5] Stopword Graphs and Authorship Attribution in Text Corpora
    Arun, R.
    Suresh, V.
    Madhavan, C. E. Veni
    [J]. 2009 IEEE THIRD INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2009), 2009, : 192 - 196
  • [6] Text Categorization for Authorship Attribution in English Poetry
    Gallagher, Catherine
    Li, Yanjun
    [J]. INTELLIGENT COMPUTING, VOL 1, 2019, 858 : 249 - 261
  • [7] Authorship Attribution on Kannada Text using Bi-Directional LSTM Technique
    Chandrika, C. P.
    Kallimani, Jagadish S.
    [J]. INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (09) : 963 - 971
  • [8] Text Documents Encoding Through Images for Authorship Attribution
    Lichtblau, Daniel
    Stoean, Catalin
    [J]. STATISTICAL LANGUAGE AND SPEECH PROCESSING, SLSP 2018, 2018, 11171 : 178 - 189
  • [9] Evolving a Weighted Combination of Text Similarities for Authorship Attribution
    Keyrouz, Youssef
    Fonlupt, Cyril
    Mezher, Dany
    Robilliard, Denis
    Faddoul, Rafic
    [J]. ARTIFICIAL EVOLUTION, EA 2019, 2020, 12052 : 13 - 27
  • [10] Authorship Attribution Using Entropy
    Grabchak, M.
    Zhang, Z.
    Zhang, D. T.
    [J]. JOURNAL OF QUANTITATIVE LINGUISTICS, 2013, 20 (04) : 301 - 313