A case study in authorship attribution: The Mondrigo

被引:0
|
作者
Sierra, Gerardo [1 ]
Hernández-García, Tonatiuh [1 ]
Gómez-Adorno, Helena [2 ]
Bel-Enguix, Gemma [1 ]
机构
[1] Instituto de Ingeniería, Universidad Nacional Autónoma de México, Ciudad de México, Mexico
[2] Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de México, Ciudad de México, Mexico
来源
关键词
K-means clustering;
D O I
10.3233/JIFS-219236
中图分类号
学科分类号
摘要
In this paper, we present authorship attribution methods applied to ¡El Mondrigo! (1968), a controversial text supposedly created by order of the Mexican Government to defame a student strike. Up to now, although the authorship of the book has been attributed to several journalists and writers, it could not be demonstrated and remains an open problem. The work aims at establishing which one of the most commonly attributed writers is the real author. To do that, we implement methods based on stylometric features using textual distance, supervised, and unsupervised learning. The distance-based methods implemented in this work are Kilgarriff and Delta of Burrows, an SVM algorithm is used as the supervised method, and the k-means algorithm as the unsupervised algorithm. The applied methods were consistent by pointing out a single author as the most likely one. © 2022 - IOS Press. All rights reserved.
引用
收藏
页码:4473 / 4480
相关论文
共 50 条
  • [1] Authorship attribution: The case of Oliver Goldsmith
    Mannion, D
    Dixon, P
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES D-THE STATISTICIAN, 1997, 46 (01) : 1 - 18
  • [2] AUTHORSHIP ATTRIBUTION
    HOLMES, DI
    [J]. COMPUTERS AND THE HUMANITIES, 1994, 28 (02): : 87 - 106
  • [3] FRACTIONAL COUNTS FOR AUTHORSHIP ATTRIBUTION - A NUMERICAL STUDY
    BURRELL, Q
    ROUSSEAU, R
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE, 1995, 46 (02): : 97 - 102
  • [4] Versification and Authorship Attribution
    Gomez Camelo, Laura Camila
    Munoz Landinez, Valeria
    [J]. LITERATURA-TEORIA HISTORIA CRITICA, 2023, 25 (02): : 308 - 315
  • [5] Authorship attribution in the wild
    Moshe Koppel
    Jonathan Schler
    Shlomo Argamon
    [J]. Language Resources and Evaluation, 2011, 45 : 83 - 94
  • [6] Championing authorship attribution
    不详
    [J]. NATURE CELL BIOLOGY, 2017, 19 (06) : 579 - 579
  • [7] Authorship Attribution and Pastiche
    Harold Somers
    Fiona Tweedie
    [J]. Computers and the Humanities, 2003, 37 : 407 - 429
  • [8] Authorship Attribution System
    Marchenko, Oleksandr
    Anisimov, Anatoly
    Nykonenko, Andrii
    Rossada, Tetiana
    Melnikov, Egor
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, NLDB 2017, 2017, 10260 : 227 - 231
  • [9] Authorship attribution and pastiche
    Somers, H
    Tweedie, F
    [J]. COMPUTERS AND THE HUMANITIES, 2003, 37 (04): : 407 - 429
  • [10] The diary of a public man: a case study in traditional and non-traditional authorship attribution
    Holmes, David I.
    Crofts, Daniel W.
    [J]. LITERARY AND LINGUISTIC COMPUTING, 2010, 25 (02): : 179 - 197