Evolving Character-Level DenseNet Architectures Using Genetic Programming

被引:4
|
作者
Londt, Trevor [1 ]
Gao, Xiaoying [1 ]
Andreae, Peter [1 ]
机构
[1] Victoria Univ Wellington, Sch Engn & Comp Sci, Wellington, New Zealand
关键词
Character-level DenseNet; Evolutionary deep learning; Genetic programming; Text classification;
D O I
10.1007/978-3-030-72699-7_42
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Densely Connected Convolutional Networks (DenseNet) have demonstrated impressive performance on image classification tasks, but limited research has been conducted on using character-level DenseNet (char-DenseNet) architectures for text classification tasks. It is not clear what DenseNet architectures are optimal for text classification tasks. The iterative task of designing, training and testing of char-DenseNets is a time consuming task that requires expert domain knowledge. Evolutionary deep learning (EDL) has been used to automatically design CNN architectures for the image classification domain, thereby mitigating the need for expert domain knowledge. This study demonstrates the first work on using EDL to evolve char-DenseNet architectures for text classification tasks. A novel genetic programming-based algorithm (GP-Dense) coupled with an indirect-encoding scheme, facilitates the evolution of performant char-DenseNet architectures. The algorithm is evaluated on two popular text datasets, and the best-evolved models are benchmarked against four current state-of-the-art character-level CNN and DenseNet models. Results indicate that the algorithm evolves performant models for both datasets that outperform two of the state-of-the-art models in terms of model accuracy and three of the stateof-the-art models in terms of parameter size.
引用
收藏
页码:665 / 680
页数:16
相关论文
共 50 条
  • [1] Character-Level Chinese Dependency Parsing
    Zhang, Meishan
    Zhang, Yue
    Che, Wanxiang
    Liu, Ting
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, : 1326 - 1336
  • [2] A Study on Dialog Act Recognition Using Character-Level Tokenization
    Ribeiro, Eugenio
    Ribeiro, Ricardo
    de Matos, David Martins
    ARTIFICIAL INTELLIGENCE: METHODOLOGY, SYSTEMS, AND APPLICATIONS, AIMSA 2018, 2018, 11089 : 93 - 103
  • [3] CharCaps: Character-Level Text Classification Using Capsule Networks
    Wu, Yujia
    Guo, Xin
    Zhan, Kangning
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, ICIC 2023, PT II, 2023, 14087 : 187 - 198
  • [4] Applying the Transformer to Character-level Transduction
    Wu, Shijie
    Cotterell, Ryan
    Hulden, Mans
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 1901 - 1907
  • [5] Character-level Adversarial Examples in Arabic
    Alshemali, Basemah
    Kalita, Jugal
    20TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2021), 2021, : 9 - 14
  • [6] Character-level convolutional networks for arithmetic operator character recognition
    Liang, Zhijie
    Li, Qing
    Liao, Shengbin
    FIFTH INTERNATIONAL CONFERENCE ON EDUCATIONAL INNOVATION THROUGH TECHNOLOGY (EITT 2016), 2016, : 208 - 212
  • [7] A Character-Level Restoration of Sukhothai Inscriptions Using The Masked Language Model
    Tongkhum, Sujitra
    Sinthupinyo, Sukree
    2023 18TH INTERNATIONAL JOINT SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND NATURAL LANGUAGE PROCESSING, ISAI-NLP, 2023,
  • [8] Character Eyes: Seeing Language through Character-Level Taggers
    Pinter, Yuval
    Marone, Marc
    Eisenstein, Jacob
    BLACKBOXNLP WORKSHOP ON ANALYZING AND INTERPRETING NEURAL NETWORKS FOR NLP AT ACL 2019, 2019, : 95 - 102
  • [9] SanskritWord Segmentation Using Character-level Recurrent and Convolutional Neural Networks
    Helwig, Oliver
    Nehrdich, Sebastian
    2018 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2018), 2018, : 2754 - 2763
  • [10] Web Application Firewall using Character-level Convolutional Neural Network
    Ito, Michiaki
    Iyatomi, Hitoshi
    2018 IEEE 14TH INTERNATIONAL COLLOQUIUM ON SIGNAL PROCESSING & ITS APPLICATIONS (CSPA 2018), 2018, : 103 - 106