Authorship Attribution and Pastiche

被引:0
|
作者
Harold Somers
Fiona Tweedie
机构
[1] Centre for Computational Linguistics,Department of Statistics
[2] UMIST,undefined
[3] University of Glasgow,undefined
来源
关键词
authorship; lexical richness; Lewis Carroll; pastiche;
D O I
暂无
中图分类号
学科分类号
摘要
This paper considers the question of authorship attribution techniques whenfaced with a pastiche. We ask whether the techniques can distinguish the real thing from the fake, or can the author fool the computer? If the latter, is this because the pastiche is good, or because the technique is faulty? Using a number of mainly vocabulary-based techniques, Gilbert Adair's pastiche of Lewis Carroll, Alice Through the Needle's Eye, is compared with the original `Alice' books. Standard measures of lexical richness, Yule's K andOrlov's Z both distinguish Adair from Carroll, though Z also distinguishesthe two originals. A principal component analysis based on word frequenciesfinds that the main differences are not due to authorship. A discriminantanalysis based on word usage and lexical richness successfully distinguishes thepastiche from the originals. Weighted cusum tests were also unable to distinguish the two authors in a majority of cases. As a cross-validation, wemade similar comparisons with control texts: another children's story from thesame era, and other work by Carroll and Adair. The implications of thesefindings are discussed.
引用
收藏
页码:407 / 429
页数:22
相关论文
共 50 条
  • [31] Authorship attribution of texts: A review
    Malyutov, M. B.
    [J]. GENERAL THEORY OF INFORMATION TRANSFER AND COMBINATORICS, 2006, 4123 : 362 - 380
  • [32] Bucketed common vector scaling for authorship attribution in heterogeneous web collections: A scaling approach for authorship attribution
    Agun, Hayri Volkan
    Yilmazel, Ozgur
    [J]. JOURNAL OF INFORMATION SCIENCE, 2020, 46 (05) : 683 - 695
  • [33] Authorship in ecology: attribution, accountability, and responsibility
    Weltzin, Jake F.
    Belote, R. Travis
    Williams, Leigh T.
    Keller, Jason K.
    Engel, E. Cayenne
    [J]. FRONTIERS IN ECOLOGY AND THE ENVIRONMENT, 2006, 4 (08) : 435 - 441
  • [34] Authorship Attribution Using Syntactic Dependencies
    Soler-Company, Juan
    Wanner, Leo
    [J]. ARTIFICIAL INTELLIGENCE RESEARCH AND DEVELOPMENT, 2016, 288 : 303 - 308
  • [35] Authorship attribution via combination of evidence
    Zhao, Ying
    Vines, Phil
    [J]. ADVANCES IN INFORMATION RETRIEVAL, 2007, 4425 : 661 - +
  • [36] Authorship Attribution of Social Media Messages
    Theophilo, Antonio
    Giot, Romain
    Rocha, Anderson
    [J]. IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2023, 10 (01) : 10 - 23
  • [37] Search for the real McCoy: Authorship Attribution
    Asesh, Aishwarya
    [J]. 2020 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2020, : 2016 - 2023
  • [38] Is word length inaccurate for authorship attribution?
    Zheng, Wanwan
    Jin, Mingzhe
    [J]. DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2023, 38 (02) : 875 - 890
  • [39] Authorship attribution using relative compression
    Pinho, Armando J.
    Pratas, Diogo
    Ferreira, Paulo J. S. G.
    [J]. 2016 DATA COMPRESSION CONFERENCE (DCC), 2016, : 329 - 338
  • [40] Authorship attribution using word sequences
    Coyotl-Morales, Rosa Maria
    Villasenor-Pineda, Luis
    Montes-y-Gomez, Manuel
    Rosso, Paolo
    [J]. PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS AND APPLICATIONS, PROCEEDINGS, 2006, 4225 : 844 - 853