Online text prediction with recurrent neural networks

被引:5
|
作者
Pérez-Ortiz, JA [1 ]
Calera-Rubio, J [1 ]
Forcada, ML [1 ]
机构
[1] Univ Alacant, Dept Llenguatges & sistemes Informat, E-03071 Alacant, Spain
关键词
arithmetic coding; online nonlinear prediction; recurrent neural networks; text compression;
D O I
10.1023/A:1012491324276
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Arithmetic coding is one of the most outstanding techniques for lossless data compression. It attains its good performance with the help of a probability model which indicates at each step the probability of occurrence of each possible input symbol given the current context. The better this model, the greater the compression ratio achieved. This work analyses the use of discrete-time recurrent neural networks and their capability for predicting the next symbol in a sequence in order to implement that model. The focus of this study is on online prediction, a task much harder than the classical offline grammatical inference with neural networks. The results obtained show that recurrent neural networks have no problem when the sequences come from the output of a finite-state machine, easily giving high compression ratios. When compressing real texts, however, the dynamics of the sequences seem to be too complex to be learned online correctly by the net.
引用
收藏
页码:127 / 140
页数:14
相关论文
共 50 条
  • [1] Online Text Prediction with Recurrent Neural Networks
    Juan Antonio Pérez-Ortiz
    Jorge Calera-Rubio
    Mikel L. Forcada
    [J]. Neural Processing Letters, 2001, 14 : 127 - 140
  • [2] Recurrent Neural Networks for Online Video Popularity Prediction
    Trzcinski, Tomasz
    Andruszkiewicz, Pawel
    Bochenski, Tomasz
    Rokita, Przemyslaw
    [J]. FOUNDATIONS OF INTELLIGENT SYSTEMS, ISMIS 2017, 2017, 10352 : 146 - 153
  • [3] Text/Non-Text Classification in Online Handwritten Documents with Recurrent Neural Networks
    Truyen Van Phan
    Nakagawa, Masaki
    [J]. 2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 23 - 28
  • [4] Robust Online Time Series Prediction with Recurrent Neural Networks
    Guo, Tian
    Xu, Zhao
    Yao, Xin
    Chen, Haifeng
    Aberer, Karl
    Funaya, Koichi
    [J]. PROCEEDINGS OF 3RD IEEE/ACM INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS, (DSAA 2016), 2016, : 816 - 825
  • [5] An Improved Segmentation of Online English Handwritten Text Using Recurrent Neural Networks
    Cuong Tuan Nguyen
    Nakagawa, Masaki
    [J]. PROCEEDINGS 3RD IAPR ASIAN CONFERENCE ON PATTERN RECOGNITION ACPR 2015, 2015, : 176 - 180
  • [6] Proficiency Prediction System for Online Learning Based on Recurrent Neural Networks
    Huang, You-Xuan
    Huang, Nen-Fu
    Tzeng, Jiang -Wei
    Liang, James
    Su, Ching -Wei
    Li, Yao-Ting
    [J]. 2022 9TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE, ISCMI, 2022, : 74 - 78
  • [7] Line-Break Prediction of Hanmun Text using Recurrent Neural Networks
    Oh, Dong Hoon
    Shah, Zahra
    Jang, Gil-Jin
    [J]. 2017 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC), 2017, : 720 - 724
  • [8] Recurrent Convolutional Neural Networks for Text Classification
    Lai, Siwei
    Xu, Liheng
    Liu, Kang
    Zhao, Jun
    [J]. PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2267 - 2273
  • [9] Convolutional Recurrent Neural Networks for Text Classification
    Wang, Ruishuang
    Li, Zhao
    Cao, Jian
    Chen, Tong
    Wang, Lei
    [J]. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [10] Recurrent Graph Neural Networks for Text Classification
    Wei, Xinde
    Huang, Hai
    Ma, Longxuan
    Yang, Ze
    Xu, Liutong
    [J]. PROCEEDINGS OF 2020 IEEE 11TH INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING AND SERVICE SCIENCE (ICSESS 2020), 2020, : 91 - 97