Character-Level Quantum Mechanical Approach for a Neural Language Model

被引:0
|
作者
Wang, Zhihao [1 ]
Ren, Min [2 ]
Tian, Xiaoyan [3 ]
Liang, Xia [1 ]
机构
[1] Shandong Univ Finance & Econ, Sch Management Sci & Engn, Jinan 250014, Peoples R China
[2] Shandong Univ Finance & Econ, Sch Math & Quantitat Econ, Jinan 250014, Peoples R China
[3] Shandong Police Coll, Jinan 250014, Peoples R China
基金
中国国家自然科学基金;
关键词
Character-level; Quantum theory; Network-in-network; Language model; SEMANTIC ANALYSIS; REPRESENTATIONS;
D O I
10.2991/ijcis.d.191114.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article proposes a character-level neural language model (NLM) that is based on quantum theory. The input of the model is the character-level coding represented by the quantum semantic space model. Our model integrates a convolutional neural network (CNN) that is based on network-in-network (NIN). We assessed the effectiveness of our model through extensive experiments based on the English-language Penn Treebank dataset. The experiments results confirm that the quantum semantic inputs work well for the language models. For example, the PPL of our model is 10%-30% less than the states of the arts, while it keeps the relatively smaller number of parameters (i.e., 6 m). (C) 2019 The Authors. Published by Atlantis Press SARL.
引用
下载
收藏
页码:1613 / 1621
页数:9
相关论文
共 50 条
  • [21] Parameter-Efficient Korean Character-Level Language Modeling
    Cognetta, Marco
    Moon, Sangwhan
    Wolf-Sonkin, Lawrence
    Okazaki, Naoaki
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 2350 - 2356
  • [22] Keyword Extraction with Character-Level Convolutional Neural Tensor Networks
    Lin, Zhe-Li
    Wang, Chuan-Ju
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2019, PT I, 2019, 11439 : 400 - 413
  • [23] DLCEncDec : A Fully Character-level Encoder-Decoder Model for Neural Responding Conversation
    Wu, Sixing
    Li, Ying
    Zhang, Xinyuan
    Wu, Zhonghai
    2018 IEEE 42ND ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1, 2018, : 516 - 521
  • [24] A Character-Level Convolutional Neural Network for Predicting Exploitability of Vulnerability
    Lyu, Jinghui
    Bai, Yude
    Xing, Zhenchang
    Li, Xiaohong
    Ge, Weimin
    2021 INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF SOFTWARE ENGINEERING (TASE 2021), 2021, : 119 - 126
  • [25] Character-Level Transformer-Based Neural Machine Translation
    Banar, Nikolay
    Daelemans, Walter
    Kestemont, Mike
    2020 4TH INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND INFORMATION RETRIEVAL, NLPIR 2020, 2020, : 149 - 156
  • [26] CHARACTER-LEVEL INCREMENTAL SPEECH RECOGNITION WITH RECURRENT NEURAL NETWORKS
    Hwang, Kyuyeon
    Sung, Wonyong
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5335 - 5339
  • [27] Character-level Intrusion Detection Based on Convolutional Neural Networks
    Lin, Steven Z.
    Shi, Yong
    Xue, Zhi
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [28] Hybrid Attention for Chinese Character-Level Neural Machine Translation
    Wang, Feng
    Chen, Wei
    Yang, Zhen
    Xu, Shuang
    Xu, Bo
    NEUROCOMPUTING, 2019, 358 : 44 - 52
  • [29] The Importance of Character-Level Information in an Event Detection Model
    Boros, Emanuela
    Besancon, Romaric
    Ferret, Olivier
    Grau, Brigitte
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2021), 2021, 12801 : 119 - 131
  • [30] Character-level neural network for biomedical named entity recognition
    Gridach, Mourad
    JOURNAL OF BIOMEDICAL INFORMATICS, 2017, 70 : 85 - 91