PRINCIPAL COMPONENT ANALYSIS FOR AUTHORSHIP ATTRIBUTION

被引:0
|
作者
Jamak, Amir [1 ]
Savatic, Alen [1 ]
Can, Mehmet [1 ]
机构
[1] Int Univ Sarajevo, Fac Engn & Nat Sci, Hrasnicka Cesta 15, Sarajevo 71000, Bosnia & Herceg
关键词
principal components; authorship attribution; stylometry; text categorization; function words; classification task; stylistic features; syntactic characteristics;
D O I
暂无
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
A common problem in statistical pattern recognition is that of feature selection or feature extraction. Feature selection refers to a process whereby a data space is transformed into a feature space that, in theory, has exactly the same dimension as the original data space. However, the transformation is designed in such a way that the data set may be represented by a reduced number of "effective" features and yet retain most of the intrinsic information content of the data; in other words, the data set undergoes a dimensionality reduction. In this paper the data collected by counting words and characters in around a thousand paragraphs of each sample book underwent a principal component analysis performed using heural networks. Then first of the principal components is used to distinguished the books authored by a certain author.
引用
收藏
页码:189 / 196
页数:8
相关论文
共 50 条
  • [21] Principal component analysis
    Michael Greenacre
    Patrick J. F. Groenen
    Trevor Hastie
    Alfonso Iodice D’Enza
    Angelos Markos
    Elena Tuzhilina
    Nature Reviews Methods Primers, 2
  • [22] Principal component analysis
    Greenacre, Michael
    Groenen, Patrick J. F.
    Hastie, Trevor
    D'Enza, Alfonso Lodice
    Markos, Angelos
    Tuzhilina, Elena
    NATURE REVIEWS METHODS PRIMERS, 2022, 2 (01):
  • [23] Principal component analysis
    Bro, Rasmus
    Smilde, Age K.
    ANALYTICAL METHODS, 2014, 6 (09) : 2812 - 2831
  • [24] Principal component analysis
    Jake Lever
    Martin Krzywinski
    Naomi Altman
    Nature Methods, 2017, 14 : 641 - 642
  • [25] Principal component analysis
    School of Behavioral and Brain Sciences, University of Texas at Dallas, MS: GR4.1, Richardson, TX 75080-3021, United States
    不详
    Wiley Interdiscip. Rev. Comput. Stat., 4 (433-459):
  • [26] Principal component analysis
    Abdi, Herve
    Williams, Lynne J.
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2010, 2 (04): : 433 - 459
  • [27] PRINCIPAL COMPONENT ANALYSIS
    WOLD, S
    ESBENSEN, K
    GELADI, P
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1987, 2 (1-3) : 37 - 52
  • [28] Principal component analysis
    Hess, Aaron S.
    Hess, John R.
    TRANSFUSION, 2018, 58 (07) : 1580 - 1582
  • [29] PRINCIPAL COMPONENT ANALYSIS
    ARIES, RE
    LIDIARD, DP
    SPRAGG, RA
    CHEMISTRY IN BRITAIN, 1991, 27 (09) : 821 - 824
  • [30] Segmented principal component transform-principal component analysis
    Barros, AS
    Rutledge, DN
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2005, 78 (1-2) : 125 - 137