PRINCIPAL COMPONENT ANALYSIS FOR AUTHORSHIP ATTRIBUTION

被引:0
|
作者
Jamak, Amir [1 ]
Savatic, Alen [1 ]
Can, Mehmet [1 ]
机构
[1] Int Univ Sarajevo, Fac Engn & Nat Sci, Hrasnicka Cesta 15, Sarajevo 71000, Bosnia & Herceg
关键词
principal components; authorship attribution; stylometry; text categorization; function words; classification task; stylistic features; syntactic characteristics;
D O I
暂无
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
A common problem in statistical pattern recognition is that of feature selection or feature extraction. Feature selection refers to a process whereby a data space is transformed into a feature space that, in theory, has exactly the same dimension as the original data space. However, the transformation is designed in such a way that the data set may be represented by a reduced number of "effective" features and yet retain most of the intrinsic information content of the data; in other words, the data set undergoes a dimensionality reduction. In this paper the data collected by counting words and characters in around a thousand paragraphs of each sample book underwent a principal component analysis performed using heural networks. Then first of the principal components is used to distinguished the books authored by a certain author.
引用
收藏
页码:189 / 196
页数:8
相关论文
共 50 条
  • [41] Scalability Issues in Authorship Attribution
    Argamon, Shlomo
    [J]. LITERARY AND LINGUISTIC COMPUTING, 2012, 27 (01): : 95 - 97
  • [42] Authorship Attribution Using Entropy
    Grabchak, M.
    Zhang, Z.
    Zhang, D. T.
    [J]. JOURNAL OF QUANTITATIVE LINGUISTICS, 2013, 20 (04) : 301 - 313
  • [43] Authorship Attribution in Arabic Poetry
    Ahmed, Alfalahi
    Mohamed, Ramdani
    Mostafa, Bellafkih
    [J]. 2016 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS: THEORIES AND APPLICATIONS (SITA), 2016,
  • [44] Authorship Attribution of Android Apps
    Gonzalez, Hugo
    Stakhanova, Natalia
    Ghorbani, Ali A.
    [J]. PROCEEDINGS OF THE EIGHTH ACM CONFERENCE ON DATA AND APPLICATION SECURITY AND PRIVACY (CODASPY'18), 2018, : 277 - 286
  • [45] Authorship Attribution of Arabic Tweets
    Rabab'ah, Abdullateef
    Al-Ayyoub, Mahmoud
    Jararweh, Yaser
    Aldwairi, Monther
    [J]. 2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [46] Future trends in authorship attribution
    Juola, Patrick
    [J]. ADVANCES IN DIGITAL FORENSIC III, 2007, 242 : 119 - 132
  • [47] Authorship Attribution of Scientific Abstracts
    Suman, Chanchal
    Saha, Sriparna
    Bhattacharyya, Pushpak
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1522 - 1528
  • [48] Computational Methods in Authorship Attribution
    Koppel, Moshe
    Schler, Jonathan
    Argamon, Shlorno
    [J]. JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (01): : 9 - 26
  • [49] An example of mathematical authorship attribution
    Basile, Chiara
    Benedetto, Dario
    Caglioti, Emanuele
    Esposti, Mirko Degli
    [J]. JOURNAL OF MATHEMATICAL PHYSICS, 2008, 49 (12)
  • [50] The "Fundamental Problem" of Authorship Attribution
    Koppel, Moshe
    Schler, Jonathan
    Argamon, Shlomo
    Winter, Yaron
    [J]. ENGLISH STUDIES, 2012, 93 (03) : 284 - 291