Text generation by probabilistic suffix tree language model

被引:0
|
作者
Marukatat, Sanparith [1 ]
机构
[1] NECTEC, Image Proc & Understanding Team, AI Res Grp, Bangkok, Thailand
关键词
Thai NLP; language model; probabilistic suffix tree;
D O I
10.1109/iSAI-NLP54397.2021.9678167
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
During last decade, language modeling has been dominated by neural structures; RNN, LSTM or Transformer. These neural language models provide excellent performance to the detriment of very high computational cost. This work investigates the use of probabilistic language model that requires much less computational cost. In particular, we are interested in variable-order Markov model that can be efficiently implemented on a probabilistic suffix tree (PST) structure. The PST construction is cheap and can be easily scaled to very large dataset. Experimental results show that this model can be used to generated realistic sentences.
引用
收藏
页数:4
相关论文
共 50 条
  • [1] A Text Similarity Measure Based on Suffix Tree
    Huang, Chenghui
    Liu, Yan
    Xia, Shengzhong
    Yin, Jian
    [J]. INFORMATION-AN INTERNATIONAL INTERDISCIPLINARY JOURNAL, 2011, 14 (02): : 583 - 592
  • [2] A Finite Context Intrusion Prediction Model for Cloud Systems with a Probabilistic Suffix Tree
    Kholidy, Hisham A.
    Yousof, Ahmed M.
    Erradi, Abdelkarim
    Abdelwahed, Sherif
    Ali, Hisham Arafat
    [J]. UKSIM-AMSS EIGHTH EUROPEAN MODELLING SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS 2014), 2014, : 526 - 531
  • [3] A Suffix Tree Or Not a Suffix Tree?
    Starikovskaya, Tatiana
    Vildhoj, Hjalte Wedel
    [J]. COMBINATORIAL ALGORITHMS, IWOCA 2014, 2015, 8986 : 338 - 350
  • [4] (sic)-generAItor: Tree-in-the-loop Text Generation for Language Model Explainability and Adaptation
    Spinner, Thilo
    Kehlbeck, Rebecca
    Sevastjanova, Rita
    Staehle, Tobias
    Keim, Daniel A.
    Deussen, Oliver
    El-Assady, Mennatallah
    [J]. ACM TRANSACTIONS ON INTERACTIVE INTELLIGENT SYSTEMS, 2024, 14 (02)
  • [5] A suffix tree or not a suffix tree?
    Starikovskaya, Tatiana
    Vildhoj, Hjalte Wedel
    [J]. JOURNAL OF DISCRETE ALGORITHMS, 2015, 32 : 14 - 23
  • [6] ANNOTATED SUFFIX TREE AS A WAY OF TEXT REPRESENTATION FOR INFORMATION RETRIEVAL IN TEXT COLLECTIONS
    Frolov, Dmitry S.
    [J]. BIZNES INFORMATIKA-BUSINESS INFORMATICS, 2015, 34 (04): : 63 - 70
  • [7] Using Annotated Suffix Tree Similarity Measure for Text Summarisation
    Yakovlev, Maxim
    Chernyak, Ekaterina
    [J]. ANALYSIS OF LARGE AND COMPLEX DATA, 2016, : 103 - 112
  • [8] An Improved Text Retrieval Algorithm Based on Suffix Tree Similarity Measure
    Huang, Cheng-hui
    Yin, Jian
    Han, Dong
    [J]. INFORMATION COMPUTING AND APPLICATIONS, PT 2, 2010, 106 : 150 - +
  • [9] METHOD OF ANNOTATED SUFFIX TREE FOR SCORING THE EXTENT OF PRESENCE OF A STRING IN TEXT
    Mirkin, B.
    Chernyak, E.
    Chugunova, O.
    [J]. BIZNES INFORMATIKA-BUSINESS INFORMATICS, 2012, 21 (03): : 31 - +
  • [10] Suffix cactus: A cross between suffix tree and suffix array
    Karkkainen, J
    [J]. COMBINATORIAL PATTERN MATCHING, 1995, 937 : 191 - 204