Study on Automatic Extraction Method of Tibetan New Words

被引:0
|
作者
Sun, Yuan [1 ]
Yan, Xiaodong [1 ]
Zhao, Xiaobing [1 ]
Yang, Guosheng [1 ]
机构
[1] Minzu Univ China, Sch Informat Engn, Beijing 100081, Peoples R China
关键词
Tibetan new words; extraction; dynamic Tibetan corpus;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a model to automatically extract Tibetan new words. Through building the dynamic Tibetan corpus from 2009 to 2012, which covers more than 18 Tibetan network media of Tibet, Qinghai, Sichuan, Gansu and Yunnan, we research on the key techniques of Tibetan new word extraction: (1) using statistical method to establish Tibetan new words knowledge base; (2) using information entropy and vector space module similarity calculation to extract/filter Tibetan new valid words; (3) using word co-occurrence techniques to extract Tibetan new meaning words.
引用
收藏
页码:130 / 133
页数:4
相关论文
共 50 条
  • [31] A New Method for Automatic Glacier Extraction by Building Decision Trees Based on Pixel Statistics
    Liu, Xiao
    Cheng, Hongyi
    Liu, Jiang
    Su, Xianbao
    Wang, Yuchen
    Qiao, Bin
    Wang, Yipeng
    Wang, Nai'ang
    REMOTE SENSING, 2025, 17 (04)
  • [32] A New Method for Automatic Extraction and Analysis of Discontinuities Based on TIN on Rock Mass Surfaces
    Wu, Xiang
    Wang, Fengyan
    Wang, Mingchang
    Zhang, Xuqing
    Wang, Qing
    Zhang, Shuo
    REMOTE SENSING, 2021, 13 (15)
  • [33] A NEW FEATURE SELECTION METHOD BASED ON CONCEPT EXTRACTION IN AUTOMATIC CHINESE TEXT CLASSIFICATION
    Liao, Shasha
    Jiang, Minghu
    NEW MATHEMATICS AND NATURAL COMPUTATION, 2007, 3 (03) : 331 - 347
  • [34] A study on the Early Processing of the Tones of Monosyllabic Words in Tibetan Lhasa Dialect
    Xu, Na
    Wang, Jing
    Yu, Hongzhi
    2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND BUSINESS ANALYTICS (ICDSBA 2018), 2018, : 273 - 277
  • [35] A Method of Automatic Translation of Words of Multiple Affixes in Scientific Literature
    Wang, Lei
    Chang, Baobao
    Harkness, J.
    11TH CHINESE LEXICAL SEMANTICS WORKSHOP (CKSW2010), 2010, : 387 - 394
  • [36] A study on the association of basic color words among Tibetan College Students
    Fang, Yanhong
    Yin, Guanhai
    Liu, Lihua
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 2023, 58 : 319 - 319
  • [37] A geometric method for automatic extraction of sulcal fundi
    Kao, C. -Y.
    Hofer, M.
    Sapiro, G.
    Stern, J.
    Rottenberg, D. A.
    2006 3RD IEEE INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING: MACRO TO NANO, VOLS 1-3, 2006, : 1168 - +
  • [38] An Automatic Extraction Method of Surveillance Visual Context
    Liang, Haozhe
    Xu, Shukui
    Li, Guohui
    INTELLIGENCE COMPUTATION AND EVOLUTIONARY COMPUTATION, 2013, 180 : 733 - 741
  • [39] Study on Tibetan-Chinese Comparable Corpus Extraction
    Sun, Yuan
    Guo, Li-li
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND COMPUTER SCIENCE (AICS 2016), 2016, : 287 - 293
  • [40] Research on automatic extraction method of thermal discharge
    Zhu, Xiu-Fang
    Li, Yuan
    Zhongguo Huanjing Kexue/China Environmental Science, 2023, 43 (11): : 6115 - 6122