Word sense disambiguation of Thai language with unsupervised learning

被引:0
|
作者
Pongpinigpinyo, S [1 ]
Rivepiboon, W [1 ]
机构
[1] Chulalongkorn Univ, Fac Engn, Dept Comp Engn, Bangkok 10330, Thailand
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many approach strategies can be employed to resolve word sense ambiguity with a reasonable degree of accuracy. These strategies are: knowledge-based, corpus-based, and hybrid-based. This paper pays attention to the corpus-based strategy that employs an unsupervised learning method for disambiguation. We report our investigation of Latent Semantic Indexing (LSI), an unsupervised learning, to the task of Thai noun and verbal word sense disambiguation. We report experiments on two Thai polysemous words, namely (SIC)/hua4/ and (SIC)/kep1/ that are used as a representative of Thai nouns and verbs respectively. The results of these experiments demonstrate the effectiveness and indicate the potential of applying vector-based distributional information measures to semantic disambiguation. Our approach performs better than a baseline system, which picks the most frequent sense.
引用
收藏
页码:1275 / 1283
页数:9
相关论文
共 50 条
  • [1] Learning Sense Representation from Word Representation for Unsupervised Word Sense Disambiguation
    Wang, Jie
    Fu, Zhenxin
    Li, Moxin
    Zhang, Haisong
    Zhao, Dongyan
    Yan, Rui
    [J]. THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13947 - 13948
  • [2] Graph Connectivity for Unsupervised Word Sense Disambiguation for HINDI Language
    Nandanwar, Lokesh
    [J]. 2015 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION, EMBEDDED AND COMMUNICATION SYSTEMS (ICIIECS), 2015,
  • [3] AN IMPROVED UNSUPERVISED LEARNING PROBABILISTIC MODEL OF WORD SENSE DISAMBIGUATION
    Li, Xu
    Zhao, Xiuyan
    Ban, Fenglong
    Liu, Bai
    [J]. PROCEEDINGS OF THE 2012 WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES, 2012, : 1071 - 1075
  • [4] An unsupervised method for word sense disambiguation
    Rahman, Nazreena
    Borah, Bhogeswar
    [J]. JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (09) : 6643 - 6651
  • [5] Word Sense Disambiguation in Bengali language using unsupervised methodology with modifications
    Alok Ranjan Pal
    Diganta Saha
    [J]. Sādhanā, 2019, 44
  • [6] A semantics-enhanced language model for unsupervised word sense disambiguation
    Lin, Shou-De
    Verspoor, Karin
    [J]. COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING, 2008, 4919 : 287 - +
  • [7] Word Sense Disambiguation in Bengali language using unsupervised methodology with modifications
    Pal, Alok Ranjan
    Saha, Diganta
    [J]. SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2019, 44 (07):
  • [8] Unsupervised Word Sense Disambiguation Using Word Embeddings
    Moradi, Behzad
    Ansari, Ebrahim
    Zabokrtsky, Zdenek
    [J]. PROCEEDINGS OF THE 2019 25TH CONFERENCE OF OPEN INNOVATIONS ASSOCIATION (FRUCT), 2019, : 228 - 233
  • [9] Unsupervised Approach to Word Sense Disambiguation in Malayalam
    Sankar, Sruthi K. P.
    Raj, P. C. Reghu
    Jayan, V
    [J]. INTERNATIONAL CONFERENCE ON EMERGING TRENDS IN ENGINEERING, SCIENCE AND TECHNOLOGY (ICETEST - 2015), 2016, 24 : 1507 - 1513
  • [10] Mnogoznal: an Unsupervised System for Word Sense Disambiguation
    Ustalov, Dmitry
    Teslenko, Denis
    Panchenko, Alexander
    Chernoskutov, Mikhail
    [J]. 2017 INTERNATIONAL MULTI-CONFERENCE ON ENGINEERING, COMPUTER AND INFORMATION SCIENCES (SIBIRCON), 2017, : 147 - 150