Automatic new word extraction method

被引:0
|
作者
Shi, Q
Shen, LQ
Chai, HX
机构
关键词
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
New words are very difficult to be extracted automatically for those languages where there is no word boundary in written texts, such as Chinese, Japanese etc. In this paper, we present a statistical method to extract new words from a large amount of corpus with no word boundary. Based on Generalized Suffix Tree (GST) data structure we define NWP (New Word Pattern) and SBP (Segmentation Boundary Pattern) to separate input strings into small pieces, and offer a practical and efficient algorithm to get the proper words from GST.
引用
收藏
页码:865 / 868
页数:4
相关论文
共 50 条
  • [31] Modelling Word Similarity. An Evaluation of Automatic Synonymy Extraction Algorithms
    Heylen, Kris
    Peirsman, Yves
    Geeraerts, Dirk
    Speelman, Dirk
    SIXTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, LREC 2008, 2008, : 3243 - 3249
  • [32] Automatic tag recommendation approach with keyphrase extraction and word embedding techniques
    Konkaew, Taechawat
    Kitisin, Sukumal
    Journal of Computers (Taiwan), 2019, 30 (02) : 135 - 149
  • [33] A Study on Automatic Extraction of New Terms
    Zhang, Xing
    Fang, Alex Chengyu
    PROCEEDINGS OF THE FIRST NORTHEAST ASIA INTERNATIONAL SYMPOSIUM ON LANGUAGE, LITERATURE AND TRANSLATION, 2011, : 48 - 55
  • [34] A new automatic concavity extraction model
    Sirakov, Nikolay Metodiev
    Simonelli, Italo
    7TH IEEE SOUTHWEST SYMPOSIUM ON IMAGE ANALYSIS AND INTERPRETATION, 2006, : 178 - 182
  • [35] New techniques for relevant word ranking and extraction
    Ventura, Joao
    Da Silva, Joaquim Ferreira
    PROGRESS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4874 : 691 - 702
  • [36] Study on Tibetan New Meaning Word Extraction
    Yuan, Sun
    2013 2ND INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION AND MEASUREMENT, SENSOR NETWORK AND AUTOMATION (IMSNA), 2013, : 404 - 407
  • [37] Concerning the method of fat determination III Announcement - Extraction apparatus with new coolers for the automatic recoupment of the extraction medium
    Zinzadze, SR
    BIOCHEMISCHE ZEITSCHRIFT, 1930, 220 : 185 - 191
  • [38] Two-Word Collocation Extraction Using Monolingual Word Alignment Method
    Liu, Zhanyi
    Wang, Haifeng
    Wu, Hua
    Li, Sheng
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2012, 3 (01)
  • [39] A New Method for Automatic Glacier Extraction by Building Decision Trees Based on Pixel Statistics
    Liu, Xiao
    Cheng, Hongyi
    Liu, Jiang
    Su, Xianbao
    Wang, Yuchen
    Qiao, Bin
    Wang, Yipeng
    Wang, Nai'ang
    REMOTE SENSING, 2025, 17 (04)
  • [40] A New Method for Automatic Extraction and Analysis of Discontinuities Based on TIN on Rock Mass Surfaces
    Wu, Xiang
    Wang, Fengyan
    Wang, Mingchang
    Zhang, Xuqing
    Wang, Qing
    Zhang, Shuo
    REMOTE SENSING, 2021, 13 (15)