Evolving Character-Level DenseNet Architectures Using Genetic Programming

被引:4
|
作者
Londt, Trevor [1 ]
Gao, Xiaoying [1 ]
Andreae, Peter [1 ]
机构
[1] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington, New Zealand
关键词
Character-level DenseNet; Evolutionary deep learning; Genetic programming; Text classification;
D O I
10.1007/978-3-030-72699-7_42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Densely Connected Convolutional Networks (DenseNet) have demonstrated impressive performance on image classification tasks, but limited research has been conducted on using character-level DenseNet (char-DenseNet) architectures for text classification tasks. It is not clear what DenseNet architectures are optimal for text classification tasks. The iterative task of designing, training and testing of char-DenseNets is a time consuming task that requires expert domain knowledge. Evolutionary deep learning (EDL) has been used to automatically design CNN architectures for the image classification domain, thereby mitigating the need for expert domain knowledge. This study demonstrates the first work on using EDL to evolve char-DenseNet architectures for text classification tasks. A novel genetic programming-based algorithm (GP-Dense) coupled with an indirect-encoding scheme, facilitates the evolution of performant char-DenseNet architectures. The algorithm is evaluated on two popular text datasets, and the best-evolved models are benchmarked against four current state-of-the-art character-level CNN and DenseNet models. Results indicate that the algorithm evolves performant models for both datasets that outperform two of the state-of-the-art models in terms of model accuracy and three of the stateof-the-art models in terms of parameter size.
引用
收藏
页码:665 / 680
页数:16
相关论文
共 50 条
  • [21] Standardizing Tweets with Character-Level Machine Translation
    Ljubesic, Nikola
    Erjavec, Tomaz
    Fiser, Darja
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, CICLING 2014, PART II, 2014, 8404 : 164 - 175
  • [22] Character-level HyperNetworks for Hate Speech Detection
    Wullach, Tomer
    Adler, Amir
    Minkov, Einat
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 205
  • [23] Generalized Character-Level Spelling Error Correction
    Farra, Noura
    Tomeh, Nadi
    Rozovskaya, Alla
    Habash, Nizar
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2014, : 161 - 167
  • [24] Word Game Modeling Using Character-Level N-Gram and Statistics
    Mattiev, Jamolbek
    Salaev, Ulugbek
    Kavsek, Branko
    MATHEMATICS, 2023, 11 (06)
  • [25] Malicious and Benign URL Dataset Generation Using Character-Level LSTM Models
    Vecile, Spencer
    Lacroix, Kyle
    Grolinger, Katarina
    Samarabandu, Jagath
    2022 5TH IEEE CONFERENCE ON DEPENDABLE AND SECURE COMPUTING (IEEE DSC 2022), 2022,
  • [26] A Multilingual and Multidomain Study on Dialog Act Recognition Using Character-Level Tokenization
    Ribeiro, Eugenio
    Ribeiro, Ricardo
    de Matos, David Martins
    INFORMATION, 2019, 10 (03)
  • [27] CHARACTER-LEVEL EMBEDDING USING FASTTEXT AND LSTM FOR BIOMEDICAL NAMED ENTITY RECOGNITION
    Al-Jumaili, Ahmed Sabah Ahmed
    Tayyeh, Huda Kadhim
    SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2024, 25 (06): : 5258 - 5264
  • [28] Evolving quantum circuits using genetic programming
    Rubinstein, BIP
    PROCEEDINGS OF THE 2001 CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1 AND 2, 2001, : 144 - 151
  • [29] Chinese Morphological Analysis with Character-level POS Tagging
    Shen, Mo
    Liu, Hongxiao
    Kawahara, Daisuke
    Kurohashi, Sadao
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2014, : 253 - 258
  • [30] Text steganography: a novel character-level embedding algorithm using font attribute
    Ramakrishnan, Bala Krishnan
    Thandra, Prasanth Kumar
    Srinivasula, A. V. Satya Murty
    SECURITY AND COMMUNICATION NETWORKS, 2016, 9 (18) : 6066 - 6079