Study on Automatic Extraction Method of Tibetan New Words

被引:0
|
作者
Sun, Yuan [1 ]
Yan, Xiaodong [1 ]
Zhao, Xiaobing [1 ]
Yang, Guosheng [1 ]
机构
[1] Minzu Univ China, Sch Informat Engn, Beijing 100081, Peoples R China
关键词
Tibetan new words; extraction; dynamic Tibetan corpus;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a model to automatically extract Tibetan new words. Through building the dynamic Tibetan corpus from 2009 to 2012, which covers more than 18 Tibetan network media of Tibet, Qinghai, Sichuan, Gansu and Yunnan, we research on the key techniques of Tibetan new word extraction: (1) using statistical method to establish Tibetan new words knowledge base; (2) using information entropy and vector space module similarity calculation to extract/filter Tibetan new valid words; (3) using word co-occurrence techniques to extract Tibetan new meaning words.
引用
收藏
页码:130 / 133
页数:4
相关论文
共 50 条
  • [1] A New Automatic Extraction Method for Glaciers on the Tibetan Plateau under Clouds, Shadows and Snow Cover
    Hu, Mingcheng
    Zhou, Guangsheng
    Lv, Xiaomin
    Zhou, Li
    He, Xiaohui
    Tian, Zhihui
    REMOTE SENSING, 2022, 14 (13)
  • [2] The Fractal Patterns of Words in a Text: A Method for Automatic Keyword Extraction
    Najafi, Elham
    Darooneh, Amir H.
    PLOS ONE, 2015, 10 (06):
  • [3] A Method of Automatic Metadata Extraction Corresponding to the Impression by Sound of the Words
    Graduate School of Systems and Information Engineering, University of Tsukuba, Japan
    不详
    Front. Artif. Intell. Appl., 1600, (206-222):
  • [4] A bootstrap method for chinese new words extraction
    He, S
    Zhu, J
    2001 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-VI, PROCEEDINGS: VOL I: SPEECH PROCESSING 1; VOL II: SPEECH PROCESSING 2 IND TECHNOL TRACK DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS NEURALNETWORKS FOR SIGNAL PROCESSING; VOL III: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING MULTIMEDIA SIGNAL PROCESSING - VOL IV: SIGNAL PROCESSING FOR COMMUNICATIONS; VOL V: SIGNAL PROCESSING EDUCATION SENSOR ARRAY & MULTICHANNEL SIGNAL PROCESSING AUDIO & ELECTROACOUSTICS; VOL VI: SIGNAL PROCESSING THEORY & METHODS STUDENT FORUM, 2001, : 581 - 584
  • [5] Automatic new word extraction method
    Shi, Q
    Shen, LQ
    Chai, HX
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 865 - 868
  • [6] Study on Tibetan New Meaning Word Extraction
    Yuan, Sun
    2013 2ND INTERNATIONAL SYMPOSIUM ON INSTRUMENTATION AND MEASUREMENT, SENSOR NETWORK AND AUTOMATION (IMSNA), 2013, : 404 - 407
  • [7] A Study on Automatic Extraction of New Terms
    Zhang, Xing
    Fang, Alex Chengyu
    PROCEEDINGS OF THE FIRST NORTHEAST ASIA INTERNATIONAL SYMPOSIUM ON LANGUAGE, LITERATURE AND TRANSLATION, 2011, : 48 - 55
  • [8] New automatic liver segmentation and extraction method
    Zhang, Pinzheng
    Xu, Qinzheng
    Wang, Zheng
    MIPPR 2007: MEDICAL IMAGING, PARALLEL PROCESSING OF IMAGES, AND OPTIMIZATION TECHNIQUES, 2007, 6789
  • [9] Automatic Extraction Method of Hot Words Based on Agricultural Network Information Classification
    Duan Q.
    Zhang L.
    Liu Y.
    Wang S.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2018, 49 (07): : 160 - 167
  • [10] Research on the Extraction Technology of Hot-words in Tibetan WebPages
    Wang, Chang-Zhi
    Xu, Gui-Xian
    Wang, Hui
    3RD ANNUAL INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY AND APPLICATIONS (ITA 2016), 2016, 7