PRINCIPAL COMPONENT ANALYSIS FOR AUTHORSHIP ATTRIBUTION

被引:0
|
作者
Jamak, Amir [1 ]
Savatic, Alen [1 ]
Can, Mehmet [1 ]
机构
[1] Int Univ Sarajevo, Fac Engn & Nat Sci, Hrasnicka Cesta 15, Sarajevo 71000, Bosnia & Herceg
关键词
principal components; authorship attribution; stylometry; text categorization; function words; classification task; stylistic features; syntactic characteristics;
D O I
暂无
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
A common problem in statistical pattern recognition is that of feature selection or feature extraction. Feature selection refers to a process whereby a data space is transformed into a feature space that, in theory, has exactly the same dimension as the original data space. However, the transformation is designed in such a way that the data set may be represented by a reduced number of "effective" features and yet retain most of the intrinsic information content of the data; in other words, the data set undergoes a dimensionality reduction. In this paper the data collected by counting words and characters in around a thousand paragraphs of each sample book underwent a principal component analysis performed using heural networks. Then first of the principal components is used to distinguished the books authored by a certain author.
引用
收藏
页码:189 / 196
页数:8
相关论文
共 50 条
  • [41] Authorship Attribution in Arabic Poetry
    Ahmed, Alfalahi
    Mohamed, Ramdani
    Mostafa, Bellafkih
    2016 11TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS: THEORIES AND APPLICATIONS (SITA), 2016,
  • [42] Authorship Attribution Using Entropy
    Grabchak, M.
    Zhang, Z.
    Zhang, D. T.
    JOURNAL OF QUANTITATIVE LINGUISTICS, 2013, 20 (04) : 301 - 313
  • [43] Authorship Attribution of Android Apps
    Gonzalez, Hugo
    Stakhanova, Natalia
    Ghorbani, Ali A.
    PROCEEDINGS OF THE EIGHTH ACM CONFERENCE ON DATA AND APPLICATION SECURITY AND PRIVACY (CODASPY'18), 2018, : 277 - 286
  • [44] Authorship Attribution of Arabic Tweets
    Rabab'ah, Abdullateef
    Al-Ayyoub, Mahmoud
    Jararweh, Yaser
    Aldwairi, Monther
    2016 IEEE/ACS 13TH INTERNATIONAL CONFERENCE OF COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2016,
  • [45] Authorship Attribution of Scientific Abstracts
    Suman, Chanchal
    Saha, Sriparna
    Bhattacharyya, Pushpak
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 1522 - 1528
  • [46] A New Approach for Authorship Attribution
    Reddy, P. Buddha
    Reddy, T. Raghunadha
    Chand, M. Gopi
    Venkannababu, A.
    INFORMATION AND DECISION SCIENCES, 2018, 701 : 1 - 9
  • [47] Estimating the Probability of an Authorship Attribution
    Savoy, Jacques
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2016, 67 (06) : 1462 - 1472
  • [48] THE REQUISITES OF UNIFORMITY AND THE ATTRIBUTION OR AUTHORSHIP
    PULIDO, M
    MEDICINA CLINICA, 1994, 103 (16): : 638 - 638
  • [49] Computational Methods in Authorship Attribution
    Koppel, Moshe
    Schler, Jonathan
    Argamon, Shlorno
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (01): : 9 - 26
  • [50] An example of mathematical authorship attribution
    Basile, Chiara
    Benedetto, Dario
    Caglioti, Emanuele
    Esposti, Mirko Degli
    JOURNAL OF MATHEMATICAL PHYSICS, 2008, 49 (12)