Genetic Optimization of Keyword Subsets in the Classification Analysis of Authorship of Texts

被引:0
|
作者
Pavlyshenko, Bohdan [1 ]
机构
[1] Ivan Franko Lviv Natl Univ, UA-79005 Lvov, Ukraine
关键词
D O I
10.1080/09296174.2014.944329
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
The genetic selection of keyword sets, the text frequencies of which are considered as attributes in text classification analysis, has been analysed. The genetic optimization was performed on a set of words, which is the fraction of the frequency dictionary with given frequency limits. The frequency dictionary was formed on the basis of an analysed text array of texts of English fiction. As the fitness function which is minimized by the genetic algorithm, the error of the nearest k neighbours classifier was used. The results obtained show high precision and recall of text classification by authorship categories on the basis of attributes of the keyword sets which were selected by the genetic algorithm from the frequency dictionary.
引用
收藏
页码:341 / 349
页数:9
相关论文
共 50 条
  • [41] Combination and optimization of classifiers in gender classification using genetic programming
    Khan, Asifullah
    Majid, Abdul
    Mirza, Anwar
    INTERNATIONAL JOURNAL OF KNOWLEDGE-BASED AND INTELLIGENT ENGINEERING SYSTEMS, 2005, 9 (01) : 1 - 11
  • [42] Genetic analysis and classification of brown arid soils
    Glazovskaya, MA
    Gorbunova, IA
    EURASIAN SOIL SCIENCE, 2002, 35 (11) : 1139 - 1148
  • [43] Genetic programming for classification: An analysis of convergence behaviour
    Loveard, T
    Ciesielski, V
    AL 2002: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2002, 2557 : 309 - 320
  • [44] Analysis and Classification of Epilepsy Stages with Genetic Programming
    Sotelo, Arturo
    Guijarro, Enrique
    Trujillo, Leonardo
    Coria, Luis
    Martinez, Yuliana
    EVOLVE - A BRIDGE BETWEEN PROBABILITY, SET ORIENTED NUMERICS, AND EVOLUTIONARY COMPUTATION II, 2013, 175 : 57 - +
  • [45] Analysis and classification of epilepsy stages with genetic programming
    Sotelo, A. (soteloo@yahoo.com), 2013, Springer Verlag (175 ADVANCES):
  • [46] Authorship Analysis in Forensic Linguistics: History, Conception and Methodological Review. Application of the Likelihood Ratio to Short Texts in Spanish
    Miguel, Mario Crespo
    REVISTA SIGNOS, 2023, 56 (111): : 35 - 58
  • [47] Neural network classification with optimization by genetic algorithms for remote sensing imagery
    Tong, Xiaohua
    Zhang, Xue
    GEOINFORMATICS 2007: REMOTELY SENSED DATA AND INFORMATION, PTS 1 AND 2, 2007, 6752
  • [48] Optimization of the ANFIS using a genetic algorithm for physical work rate classification
    Habibi, Ehsanollah
    Salehi, Mina
    Yadegarfar, Ghasem
    Taheri, Ali
    INTERNATIONAL JOURNAL OF OCCUPATIONAL SAFETY AND ERGONOMICS, 2020, 26 (03) : 436 - 443
  • [49] Genetic algorithm for the optimization of features and neural networks in ECG signals classification
    Li, Hongqiang
    Yuan, Danyang
    Ma, Xiangdong
    Cui, Dianyin
    Cao, Lu
    SCIENTIFIC REPORTS, 2017, 7
  • [50] Genetic algorithm for the optimization of features and neural networks in ECG signals classification
    Hongqiang Li
    Danyang Yuan
    Xiangdong Ma
    Dianyin Cui
    Lu Cao
    Scientific Reports, 7