Word sense disambiguation of Thai language with unsupervised learning

被引:0
|
作者
Pongpinigpinyo, S [1 ]
Rivepiboon, W [1 ]
机构
[1] Chulalongkorn Univ, Fac Engn, Dept Comp Engn, Bangkok 10330, Thailand
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many approach strategies can be employed to resolve word sense ambiguity with a reasonable degree of accuracy. These strategies are: knowledge-based, corpus-based, and hybrid-based. This paper pays attention to the corpus-based strategy that employs an unsupervised learning method for disambiguation. We report our investigation of Latent Semantic Indexing (LSI), an unsupervised learning, to the task of Thai noun and verbal word sense disambiguation. We report experiments on two Thai polysemous words, namely (SIC)/hua4/ and (SIC)/kep1/ that are used as a representative of Thai nouns and verbs respectively. The results of these experiments demonstrate the effectiveness and indicate the potential of applying vector-based distributional information measures to semantic disambiguation. Our approach performs better than a baseline system, which picks the most frequent sense.
引用
收藏
页码:1275 / 1283
页数:9
相关论文
共 50 条
  • [31] Unsupervised bilingual word sense disambiguation using Web statistics
    Wang, Y
    Hoffmann, A
    [J]. AI 2005: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2005, 3809 : 1167 - 1172
  • [32] Combining unsupervised lexical knowledge methods for word sense disambiguation
    Rigau, G
    Atserias, J
    Agirre, E
    [J]. 35TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 8TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, 1997, : 48 - 55
  • [33] Unsupervised word sense disambiguation with N-gram features
    Daniel Preotiuc-Pietro
    Florentina Hristea
    [J]. Artificial Intelligence Review, 2014, 41 : 241 - 260
  • [34] An Experimental Study of Graph Connectivity for Unsupervised Word Sense Disambiguation
    Navigli, Roberto
    Lapata, Mirella
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (04) : 678 - 692
  • [35] WORD SENSE DISAMBIGUATION: A STRUCTURED LEARNING PERSPECTIVE
    Zhou, Yun
    Wang, Ting
    Wang, Zhiyuan
    [J]. COMPUTING AND INFORMATICS, 2015, 34 (06) : 1257 - 1288
  • [36] Word-Sense Disambiguation of Korean Predicates using Sejong Electronic Dictionary and Unsupervised learning
    Kang, Sangwook
    Oh, Yeontaek
    Kim, Minho
    Kwon, Hyuk-chul
    [J]. CIT/IUCC/DASC/PICOM 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY - UBIQUITOUS COMPUTING AND COMMUNICATIONS - DEPENDABLE, AUTONOMIC AND SECURE COMPUTING - PERVASIVE INTELLIGENCE AND COMPUTING, 2015, : 257 - 261
  • [37] An improved word sense disambiguation method for Chinese full-words based on unsupervised learning
    Li X.
    Liu G.-H.
    Zhang D.-M.
    [J]. Zidonghua Xuebao/ Acta Automatica Sinica, 2010, 36 (01): : 184 - 187
  • [38] A Literature Survey on Word Sense Disambiguation for the Hindi Language
    Gujjar, Vinto
    Mago, Neeru
    Kumari, Raj
    Patel, Shrikant
    Chintalapudi, Nalini
    Battineni, Gopi
    [J]. INFORMATION, 2023, 14 (09)
  • [39] Word sense disambiguation using heterogeneous language resources
    Shirai, K
    Tamagaki, T
    [J]. NATURAL LANGUAGE PROCESSING - IJCNLP 2004, 2005, 3248 : 377 - 385
  • [40] Word Sense Disambiguation of Polysemy Words in Kannada Language
    Shashank, N. S.
    Kallimani, Jagadish S.
    [J]. 2017 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2017, : 641 - 644