PRINCIPAL COMPONENT ANALYSIS FOR AUTHORSHIP ATTRIBUTION

被引:0
|
作者
Jamak, Amir [1 ]
Savatic, Alen [1 ]
Can, Mehmet [1 ]
机构
[1] Int Univ Sarajevo, Fac Engn & Nat Sci, Hrasnicka Cesta 15, Sarajevo 71000, Bosnia & Herceg
关键词
principal components; authorship attribution; stylometry; text categorization; function words; classification task; stylistic features; syntactic characteristics;
D O I
暂无
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
A common problem in statistical pattern recognition is that of feature selection or feature extraction. Feature selection refers to a process whereby a data space is transformed into a feature space that, in theory, has exactly the same dimension as the original data space. However, the transformation is designed in such a way that the data set may be represented by a reduced number of "effective" features and yet retain most of the intrinsic information content of the data; in other words, the data set undergoes a dimensionality reduction. In this paper the data collected by counting words and characters in around a thousand paragraphs of each sample book underwent a principal component analysis performed using heural networks. Then first of the principal components is used to distinguished the books authored by a certain author.
引用
收藏
页码:189 / 196
页数:8
相关论文
共 50 条
  • [31] Authorship Attribution vs. Adversarial Authorship from a LIWC and Sentiment Analysis Perspective
    Gaston, Joshua
    Narayanan, Mina
    Dozier, Gerry
    Cothran, D. Lisa
    Arms-Chavez, Clarissa
    Rossi, Marcia
    King, Michael C.
    Xu, Jinsheng
    2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, : 920 - 927
  • [32] Degrees of freedom estimation in Principal Component Analysis and Consensus Principal Component Analysis
    Hassani, Sahar
    Martens, Harald
    Qannari, El Mostafa
    Kohler, Achim
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2012, 118 : 246 - 259
  • [33] Morphological and biochemical traits attribution for fodder yield and quality in forage sorghum using principal component analysis
    Pummy Kumari
    Neeraj Kharor
    S. K. Pahuja
    D. S. Phogat
    Vegetos, 2024, 37 (5): : 2059 - 2068
  • [34] On the Feasibility of Malware Authorship Attribution
    Alrabaee, Saed
    Shirani, Paria
    Debbabi, Mourad
    Wang, Lingyu
    FOUNDATIONS AND PRACTICE OF SECURITY, FPS 2016, 2017, 10128 : 256 - 272
  • [35] Authorship Attribution of Arabic Articles
    Hajja, Maha
    Yahya, Ahmad
    Yahya, Adnan
    ARABIC LANGUAGE PROCESSING: FROM THEORY TO PRACTICE, ICALP 2019, 2019, 1108 : 194 - 208
  • [36] Authorship attribution for electronic documents
    Juola, Patrick
    ADVANCES IN DIGITAL FORENSICS II, 2006, 222 : 119 - 130
  • [37] The Software for Authorship and Style Attribution
    Khomytska, Iryna
    Teslyuk, Vasyl
    2019 IEEE 15TH INTERNATIONAL CONFERENCE ON THE EXPERIENCE OF DESIGNING AND APPLICATION OF CAD SYSTEMS (CADSM'2019), 2019,
  • [38] An Identification of Source and Attribution of Authorship
    McKeown, Simon
    EUROPEAN JOURNAL OF SCANDINAVIAN STUDIES, 2021, 51 (02) : 319 - 334
  • [39] Scalability Issues in Authorship Attribution
    Argamon, Shlomo
    LITERARY AND LINGUISTIC COMPUTING, 2012, 27 (01): : 95 - 97
  • [40] Future trends in authorship attribution
    Juola, Patrick
    ADVANCES IN DIGITAL FORENSIC III, 2007, 242 : 119 - 132