Author Identification Using Chaos Game Representation and Deep Learning

被引:6
|
作者
Stoean, Catalin [1 ,2 ]
Lichtblau, Daniel [3 ]
机构
[1] Univ Bucharest, Human Language Technol Res Ctr, Bucharest 010014, Romania
[2] Univ Malaga, Grp Ingn Sistemas Integrados, ETSI Telecomunicac, Malaga 29071, Spain
[3] Wolfram Res, Champaign, IL 61820 USA
关键词
authorship attribution; chaos game representation; deep learning; ATTRIBUTION;
D O I
10.3390/math8111933
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
An author unconsciously encodes in the written text a certain style that is often difficult to recognize. Still, there are many computational means developed for this purpose that take into account various features, from lexical and character-based attributes to syntactic or semantic ones. We propose an approach that starts from the character level and uses chaos game representation to illustrate documents like images which are subsequently classified by a deep learning algorithm. The experiments are made on three data sets and the outputs are comparable to the results from the literature. The study also verifies the suitability of the method for small data sets and whether image augmentation can improve the classification efficiency.
引用
收藏
页码:1 / 19
页数:18
相关论文
共 50 条
  • [1] Deep learning on chaos game representation for proteins
    Loechel, Hannah F.
    Eger, Dominic
    Sperlea, Theodor
    Heider, Dominik
    [J]. BIOINFORMATICS, 2020, 36 (01) : 272 - 279
  • [2] Author Identification using Deep Learning
    Mohsen, Ahmed M.
    El-Makky, Nagwa M.
    Ghanem, Nagia
    [J]. 2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 898 - 903
  • [3] Accurate and fast clade assignment via deep learning and frequency chaos game representation
    Avila Cartes, Jorge
    Anand, Santosh
    Ciccolella, Simone
    Bonizzoni, Paola
    Della Vedova, Gianluca
    [J]. GIGASCIENCE, 2022, 12
  • [4] Accurate and fast clade assignment via deep learning and frequency chaos game representation
    Avila Cartes, Jorge
    Anand, Santosh
    Ciccolella, Simone
    Bonizzoni, Paola
    Della Vedova, Gianluca
    [J]. GIGASCIENCE, 2023, 12
  • [5] Classification Experiments of DNA Sequences by Using a Deep Neural Network and Chaos Game Representation
    Rizzo, Riccardo
    Fiannaca, Antonino
    La Rosa, Massimo
    Urso, Alfonso
    [J]. COMPUTER SYSTEMS AND TECHNOLOGIES, COMPSYSTECH'16, 2016, : 222 - 228
  • [6] Intelligent Cybersecurity Classification Using Chaos Game Optimization with Deep Learning Model
    Alabdulkreem, Eatedal
    Alotaibi, Saud S.
    Alamgeer, Mohammad
    Marzouk, Radwa
    Hilal, Anwer Mustafa
    Motwakel, Abdelwahed
    Zamani, Abu Sarwar
    Rizwanullah, Mohammed
    [J]. COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 45 (01): : 971 - 983
  • [7] Chaos game representation of proteins
    Basu, S
    Pan, A
    Dutta, C
    Das, J
    [J]. JOURNAL OF MOLECULAR GRAPHICS & MODELLING, 1997, 15 (05): : 279 - 289
  • [8] Chaos Game Representation\ast
    Chan, Eunice Y. S.
    Corless, Robert M.
    [J]. SIAM REVIEW, 2023, 65 (01) : 261 - 290
  • [9] Multi source deep learning method for drug-protein interaction prediction using k-mers and chaos game representation
    Mesrabadi, Hengame Abbasi
    Faez, Karim
    Pirgazi, Jamshid
    [J]. CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2024, 246
  • [10] Detection of Mammalian Coding Sequences Using a Hybrid Approach of Chaos Game Representation and Machine Learning
    Emam, Mohamed
    Ali, Amna
    Abdelrazik, Eman
    Elattar, Mustafa
    El-Hadidi, Mohamed
    [J]. 2020 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2020, : 2949 - 2951